Thermophilic and thermoacidophilic biopolymer-degrading genes and enzymes from alicyclobacillus acidocaldarius and related organisms, methods

ABSTRACT

Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from  Alicyclobacillus acidocaldarius  are provided. Further provided are methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from  Alicyclobacillus acidocaldarius.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 15/191,113, filed Jun. 23, 2016, pending, which is a continuation of U.S. patent application Ser. No. 14/727,653, filed Jun. 1, 2015, now U.S. Pat. No. 9,404,134 issued Aug. 2, 2016, which is a continuation of U.S. patent application Ser. No. 13/930,517, filed Jun. 28, 2013, now U.S. Pat. No. 9,045,741, issued Jun. 2, 2015, which is a continuation of U.S. patent application Ser. No. 12/927,504, filed Nov. 15, 2010, now U.S. Pat. No. 8,497,110, issued Jul. 30, 2013, which is a continuation-in-part, of U.S. patent application Ser. No. 12/322,359, filed Jan. 29, 2009, now U.S. Pat. No. 7,858,353, issued Dec. 28, 2010, for “THERMOPHILIC AND THERMOACIDOPHILIC BIOPOLYMER-DEGRADING GENES AND ENZYMES FROM ALICYCLOBACILLUS ACIDOCALDARIUS AND RELATED ORGANISMS, METHODS,” which itself claims the benefit of the filing date of U.S. Provisional Patent Application Ser. No. 61/025,136, filed Jan. 31, 2008, of the same title, the disclosure of each of which is hereby incorporated herein in its entirety by this reference.

GOVERNMENT RIGHTS

This invention was made with government support under Contract Number DE-AC07-991D13727 and Contract Number DE-AC07-051D14517 awarded by the United States Department of Energy. The government has certain rights in the invention.

STATEMENT ACCORDING TO 37 C.F.R. § 1.821(c) or (e)—SEQUENCE LISTING SUBMITTED AS A TXT FILE

Pursuant to 37 C.F.R. § 1.821(c) or (e), a file containing an electronic version of the Sequence Listing has been submitted concomitant with this application, the contents of which are hereby incorporated by reference.

TECHNICAL FIELD

The present invention relates generally to biotechnology. More specifically, the present invention relates to isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and methods for their use.

BACKGROUND

Dilute acid hydrolysis to remove hemicellulose from lignocellulosic materials is one of the most developed pretreatment techniques for lignocellulose and is currently favored (Hamelinck et al., 2005) because it results in fairly high yields of xylose (75% to 90%). Conditions that are typically used range from 0.1 to 1.5% sulfuric acid and temperatures above 160° C. The high temperatures used result in significant levels of thermal decomposition products that inhibit subsequent microbial fermentations (Lavarack et al., 2002). High temperature hydrolysis requires pressurized systems, steam generation, and corrosion resistant materials in reactor construction due to the more corrosive nature of acid at elevated temperatures.

Low temperature acid hydrolyses are of interest because they have the potential to overcome several of the above shortcomings (Tsao et al., 1987). It has been demonstrated that 90% of hemicellulose can be solubilized as oligomers in a few hours of acid treatment in the temperature range of 80° C. to 100° C. It has also been demonstrated that the sugars produced in low temperature acid hydrolysis are stable under those same conditions for at least 24 hours with no detectable degradation to furfural decomposition products. Finally, sulfuric acid typically used in pretreatments is not as corrosive at lower temperatures. The use of lower temperature acid pretreatments requires much longer reaction times to achieve acceptable levels of hydrolysis. Although 90% hemicellulose solubilization has been shown (Tsao, 1987), the bulk of the sugars are in the form of oligomers and are not in the monomeric form. The organisms currently favored in subsequent fermentation steps cannot utilize sugar oligomers (Garrote et al., 2001) and the oligomer-containing hydrolysates require further processing to monomers, usually as a second acid or alkaline hydrolysis step (Garrote et al., 2001).

Other acidic pretreatment methods include autohydrolysis and hot water washing. In autohydrolysis, biomass is treated with steam at high temperatures (˜240° C.), which cleaves acetyl side chains associated with hemicellulose to produce acetic acid that functions in a similar manner to sulfuric acid in acid hydrolysis. Higher pretreatment temperatures are required as compared to dilute acid hydrolysis because acetic acid is a much weaker acid than sulfuric. At temperatures below 240° C., the hemicellulose is not completely hydrolyzed to sugar monomers and has high levels of oligomers (Garrote et al., 2001). In hot water washing, biomass is contacted with water (under pressure) at elevated temperatures of 160° C. to 220° C. This process can effectively hydrolyze greater than 90% of the hemicellulose present and the solubilized hemicellulose was typically over 95% in the form of oligomers (Liu and Wyman, 2003).

BRIEF SUMMARY OF THE INVENTION

Embodiments of the invention relate to purified and/or isolated nucleotide sequences of the genome of Alicyclobacillus acidocaldarius, or a homologue or fragment thereof. In one embodiment of the invention, the nucleotide sequence is selected from SEQ ID NOs:1, 18, 35, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, 438, 455, 457, 459, 461, 463 or a homologue or fragment thereof. In another embodiment of the invention, the homologue is selected from the group consisting of a nucleotide sequence having at least 80% sequence identity to SEQ ID NOs:1, 18, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, or 438; at least 93% sequence identity to SEQ ID NO:461; at least 94% sequence identity to SEQ ID NO:35; at least 96% sequence identity to SEQ ID NO:459; at least 99% sequence identity to SEQ ID NO:463; at least 99.6% sequence identity to SEQ ID NO:457; and at least 99.7% sequence identity to SEQ ID NO:455.

Embodiments of the invention may further relate to an isolated and/or purified nucleic acid sequence comprising a nucleic acid sequence encoding a polypeptide selected from the group consisting of a polypeptide having at least 90% sequence identity to SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, or 439; at least 93% sequence identity to SEQ ID NO:462; at least 94% sequence identity to SEQ ID NO:36; at least 96% sequence identity to SEQ ID NO:460; at least 99% sequence identity to SEQ ID NO:464; at least 99.6% sequence identity to SEQ ID NO:458; and at least 99.7% sequence identity to SEQ ID NO:456.

Embodiments of the invention also relate to isolated and/or purified polypeptides encoded by a nucleotide sequence of the genome of Alicyclobacillus acidocaldarius, or a homologue or fragment thereof. In one embodiment, the nucleotide sequence is selected from the group consisting of a nucleotide sequence having at least 80% sequence identity to SEQ ID NOs:1, 18, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, or 438; at least 93% sequence identity to SEQ ID NO:461, at least 94% sequence identity to SEQ ID NO:35; at least 96% sequence identity to SEQ ID NO:459; at least 99% sequence identity to SEQ ID NO:463; at least 99.6% sequence identity to SEQ ID NO:457; and at least 99.7% sequence identity to SEQ ID NO:455.

In another embodiment of the invention, the nucleotide sequence is selected from SEQ ID NOs:1, 18, 35, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, 438, 455, 457, 459, 461, 463 or a homologue or fragment thereof. In still another embodiment, the polypeptide has the amino acid sequence of SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, 439, 456, 458, 460, 462, or 464. In yet another embodiment, the polypeptide is selected from the group consisting of a polypeptide having at least 90% sequence identity to SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, or 439; at least 93% sequence identity to SEQ ID NO:462; at least 94% sequence identity to SEQ ID NO:36; at least 96% sequence identity to SEQ ID NO:460; at least 99% sequence identity to SEQ ID NO:464; at least 99.6% sequence identity to SEQ ID NO:458; and at least 99.7% sequence identity to SEQ ID NO:456.

In embodiments of the invention, the polypeptides may be acidophilic and/or thermophilic. In further embodiments, the polypeptides may be glycosylated, pegylated, and/or otherwise post-translationally modified.

Embodiments of the invention include methods of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. Such methods may comprise placing a polypeptide selected from the group consisting of a polypeptide having at least 90% sequence identity to SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, or 439; at least 93% sequence identity to SEQ ID NO:462; at least 94% sequence identity to SEQ ID NO:36; at least 96% sequence identity to SEQ ID NO:460; at least 99% sequence identity to SEQ ID NO:464; at least 99.6% sequence identity to SEQ ID NO:458; and at least 99.7% sequence identity to SEQ ID NO:456 in fluid contact with a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylan, glycoside, xylan-, glucan-, galactan-, and/or mannan-decorating group.

These and other aspects of the invention will become apparent to the skilled artisan in view of the teachings contained herein.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

FIGS. 1A and 1B depict a sequence alignment between SEQ ID NO:2 (RAAC00169), an esterase of the alpha-beta hydrolase superfamily, and gi:121533815, gi:89099582, gi:16078568, gi:15615150, and gi:124524344 (SEQ ID NOs:3-7, respectively) which are all esterases of the alpha-beta hydrolase superfamily. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 2A and 2B depict a sequence alignment between SEQ ID NO:19 (RAAC00501), an alpha beta hydrolase, gi:125974699, gi:15613871, gi:5457696, gi:14520481, and gi:40744233 and (SEQ ID NOs:20-24, respectively) which are all alpha beta hydrolases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 3A, 3B, and 3C depict a sequence alignment between SEQ ID NO:36 (RAAC00568), an alpha-glucosidase, and gi:6686567, gi:4586418, gi|89098051, and gi|114844717 (SEQ ID NOs:37-40, respectively) which are all alpha-glucosidases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 4A, 4B, and 4C depict a sequence alignment between SEQ ID NO:52 (RAAC00594) and gi|16131527, gi|52081844, gi|52787233, gi|16504867, and gi|16422318 (SEQ ID NOs:53-57, respectively) which are all alpha-xylosidases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 5A and 5B depict a sequence alignment between SEQ ID NO:69 (RAAC00602), an alpha-L-arabinofuranosidase, and gi:6079924, gi:89095985, gi:15614424, gi:52081375, and gi:52786751 (SEQ ID NOs:70-74, respectively) which are all alpha-L-arabinofuranosidases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 6A and 6B depict a sequence alignment between SEQ ID NO:86 (RAAC00798), a cell wall-associated hydrolase, and gi|15893601, gi|15896196, gi|15893600, and gi|116513351 (SEQ ID NOs:87-90, respectively) which are all cell wall-associated hydrolases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 7A and 7B depict a sequence alignment between SEQ ID NO:102 (RAAC01076), an altronate hydrolase, and gi|15613053, gi|121533397, gi|52081816, gi|52787203, and gi|15893984 (SEQ ID NOs:103-107, respectively) which are all altronate hydrolases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 8A and 8B depict a sequence alignment between SEQ ID NO:119 (RAAC01219) and gi|125973125, gi|76796625, gi|20515428, gi|114843317, and gi|76795342 (SEQ ID NOs:120-124, respectively) which are all cellulase/endoglucanase Ms. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 9A and 9B depict a sequence alignment between SEQ ID NO:136 (RAAC01220) and gi|125973126, gi|20515429, gi|76796624, gi|114843316, and gi|15893508 (SEQ ID NOs:137-141, respectively) which are all cellulase/endoglucanase Ms. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIG. 10 depicts a sequence alignment between SEQ ID NO:153 (RAAC01221), a cellulase/endoglucanase M, and gi:20515430, gi:76796623, gi:125973127, and gi:125973126 (SEQ ID NOs:154-156 and 137, respectively) which are all cellulase/endoglucanase Ms. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 11A-11C depict a sequence alignment between SEQ ID NO:168 (RAAC01275), a polygalacturonase, and gi:89098529, gi:116623151, gi:116620373, gi:52081815, and gi:52787202 (SEQ ID NOs:169-173, respectively) which are all polygalacturonases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 12A-12C depict a sequence alignment between SEQ ID NO:185 (RAAC01615), an alpha-galactosidase, and gi|15614786, gi|90961985, gi|148544139, gi|76796346, and gi:114844315 (SEQ ID NOs:186-190, respectively) which are all alpha-galactosidases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 13A-13K depict a sequence alignment between SEQ ID NO:202 (RAAC01621), a cellobiose phosphorylase, and gi|125973736, gi|114844102, gi|20517160, gi|76795700, and gi|118725340 (SEQ ID NOs:203-207, respectively) which are all cellobiose phosphorylases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 14A-14C depict a sequence alignment between SEQ ID NO:219 (RAAC01755) and gi|15616253, gi|89099466, gi|17227827, gi|72163378, and gi|13470878 (SEQ ID NOs:220-224, respectively) which are all glycogen debranching enzymes. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 15A and 15B depict a sequence alignment between SEQ ID NO:236 (RAAC01887), a cellulase/endoglucanase M, and gi|52081384, gi|124521982, gi|89098880, gi|121533826, and gi|15615819 (SEQ ID NOs:237-240, respectively) which are all cellulase/endoglucanase Ms. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 16A and 16B depict a sequence alignment between SEQ ID NO:253 (RAAC01897), an acetyl esterase/acetyl hydrolase, and gi|21221842, gi|13470513, gi|13471782, gi|16329563, and gi|15600577 (SEQ ID NOs:254-258, respectively) which are all acetyl esterase/acetyl hydrolases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 17A and 17B depict a sequence alignment between SEQ ID NO:270 (RAAC01917), a beta-1,4-xylanase, and gi|114054545, gi|134266943, gi|39654242, gi|61287936, and gi|3201483 (SEQ ID NOs:271-275, respectively) which are all beta-1,4-xylanases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 18A and 18B depict a sequence alignment between SEQ ID NO:287 (RAAC02404), a cinnamoyl ester hydrolase, and gi|76796576, gi|114845181, gi|15896898, gi|15806073, and gi|58448090 (SEQ ID NOs:288-292, respectively) which are all cinnamoyl ester hydrolases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 19A and 19B depict a sequence alignment between SEQ ID NO:304 (RAAC02424), a carboxylesterase type B, and gi|56421584, gi|134105165, gi|124521931, gi|33311865, and gi|138896639 (SEQ ID NOs:305-309, respectively) which are all carboxylesterase type Bs. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 20A-20D depict a sequence alignment between SEQ ID NO:321 (RAAC02616), a beta galactosidase/beta-glucuronidase, and gi|29377189, gi|116493950, gi|40745013, and gi|49176308 (SEQ ID NOs:322-325, respectively) which are all beta galactosidase/beta-glucuronidases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 21A-21D depict a sequence alignment between SEQ ID NO:337 (RAAC02661), a xylan alpha-1,2-glucuronidase, and gi|15613624, gi|118725970, gi|148270004, gi|15642830, and gi|116621784 (SEQ ID NOs:338-342, respectively) which are all xylan alpha-1,2-glucuronidases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 22A-22C depict a sequence alignment between SEQ ID NO:354 (RAAC02925), a 3-hydroxyisobutyryl-CoA hydrolase, and gi|52080473, gi|17552962, gi|15292329, gi|66851010, and gi|40739053 (SEQ ID NOs:355-359, respectively) which are all 3-hydroxyisobutyryl-CoA hydrolases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 23A-23D depict a sequence alignment between SEQ ID NO:371 (RAAC03001), a beta-glucosidase B-related glycosidase, and gi|125973771, gi|116617985, gi|116494248 gi|116334524, and gi|66851551 (SEQ ID NOs:372-376, respectively) which are all beta-glucosidase B-related glycosidases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 24A and 24B depict a sequence alignment between SEQ ID NO:388 (RAAC02913), a chitooligosaccharide deacetylase, and gi|15614969, gi|124523066, gi|114843671 gi|89101184, and gi|2634042 (SEQ ID NOs:389-393, respectively) which are all chitooligosaccharide deacetylases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 25A and 25B depict a sequence alignment between SEQ ID NO:405 (RAAC02839), a chitooligosaccharide deacetylase, and gi|1595264, gi|20803949, gi|17380381 gi|128438, and gi|1001913 (SEQ ID NOs:406-409, respectively) which are all chitooligosaccharide deacetylases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 26A-26C depict a sequence alignment between SEQ ID NO:422 (RAAC00961), a chitooligosaccharide deacetylase, and gi|124523411, gi|158060979, gi|21219643 gi|13475158, and gi|21219455 (SEQ ID NOs:423-427, respectively) which are all chitooligosaccharide deacetylases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIGS. 27A and 27B depict a sequence alignment between SEQ ID NO:439 (RAAC00361), a chitooligosaccharide deacetylase, and gi|52078651, gi|16077225, gi|89100395 gi|15612806, and gi|121535454 (SEQ ID NOs:440-444, respectively) which are all chitooligosaccharide deacetylases. Amino acids common to three or more of the sequences aligned are indicated in bold.

FIG. 28 is a graphical representation of the relative Alpha-L-arabinofuranosidase activity of RAAC00602 (SEQ ID NO:69) produced in E. coli. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

FIG. 29 is a graphical representation of the relative Alpha-L-arabinofuranosidase activity of RAAC00602 (SEQ ID NO:69) produced in P. pastoris. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

FIG. 30 is a graphical representation of the relative 1,4-β-glucan cellobiohydrolase (CBH) activity of RAAC01917 (SEQ ID NO:270) produced in E. coli. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

FIG. 31 is a graphical representation of the relative endo-1,4-β-xylanase (XYL) activity of RAAC01917 (SEQ ID NO:270) produced in E. coli. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

FIG. 32 is a graphical representation of the relative α-glucuronidase (AGUR) activity of RAAC02661 (SEQ ID NO:337) produced in E. coli. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

FIG. 33 is a graphical representation of the relative β-glucosidase (BGLU) activity of RAAC03001 (SEQ ID NO:371) produced in E. coli. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

FIG. 34 is a graphical representation of the relative α-L-arabinofuranosidase (AFS) activity of RAAC03001 (SEQ ID NO:371) produced in E. coli. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

FIG. 35 is a graphical representation of the relative β-galactosidase (BGAL) activity of RAAC03001 (SEQ ID NO:371) produced in E. coli. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

FIG. 36 is a graphical representation of the relative β-xylosidase (BXYL) activity of RAAC03001 (SEQ ID NO:371) produced in E. coli. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

FIG. 37 is a graphical representation of the relative 1,4-β-glucan cellobiohydrolase (CBH) activity of RAAC03001 (SEQ ID NO:371) produced in E. coli. Diamonds indicate the activity at 50° C., squares indicate the activity at 60° C., triangles indicate the activity at 70° C., Xs indicate the activity at 80° C., and circles indicate the activity at 90° C.

DETAILED DESCRIPTION OF THE INVENTION

Lignocellulose is a highly heterogeneous three-dimensional matrix comprised primarily of cellulose, hemicellulose, and lignin. Many fuels and chemicals can be made from these lignocellulosic materials. To utilize lignocellulosic biomass for production of fuels and chemicals via fermentative processes, it is necessary to convert the plant polysaccharides to sugar monomers which are then fermented to products using a variety of microorganisms. Direct hydrolysis of lignocellulose by mineral acids to monomers is possible at high temperature and pressure, leading to yield losses due to thermal decomposition of the sugars. Utilizing existing commercially available enzymes, a first strategy to reduce these yield losses is to perform the pretreatment at reduced severity to produce soluble oligomers, followed by the use of cellulases and hemicellulases to depolymerize the polysaccharides at moderate temperatures. In a second approach, the addition of acid stable thermotolerant hydrolytic enzymes including cellulases, xylanases and other hemicellulases to the biomass slurry during the pretreatment allows the use of further reduced temperatures and pressures during the pretreatment, as well as cheaper materials of construction, reducing both the capital and energy costs. An extension of this second approach is to combine the enzyme-assisted reduced severity pretreatment together with fermentation under the same conditions, which further reduces costs.

For commercially available enzymes to be utilized, the first strategy must be used. The second approach represents a significant improvement in the art because the pretreatment and bioconversion of the polysaccharides to products can be achieved in fewer steps/vessels and without intermediately altering the process conditions.

Embodiments of the invention relate in part to the gene sequences and protein sequences encoded by genes of Alicyclobacillus acidocaldarius. Genes included are those necessary to depolymerize biopolymers including lignocellulosic polysaccharides, starches, chitin, polyhydroxybutyrate, and the like, to monomers or oligomers. Intracellular enzyme activities will be thermophilic in nature and general examples of similar genes are described in the literature. Extracellular enzyme activities will be thermoacidophilic (simultaneously thermophilic and acidophilic). The following classes of enzymes are included for polysaccharide depolymerization: glycosyl hydrolases (or glycoside hydrolases), esterases including acetylxylan esterases and p-cumaric acid esterases and ferulic acid esterases, and uronidases. An additional class of enzymes for biopolymer depolymerization includes polyhydroxybutyrate-degrading enzymes.

The present invention relates to isolated and/or purified nucleotide sequences of the genome of Alicyclobacillus acidocaldarius selected from the sequences SEQ ID NOs:1, 18, 35, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, 438, 455, 457, 459, 461, or 463 or one of their fragments.

The present invention likewise relates to isolated and/or purified nucleotide sequences, characterized in that they are selected from: a) a nucleotide sequence of a specific fragment of the sequence SEQ ID NOs:1, 18, 35, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, 438, 455, 457, 459, 461, or 463 or one of their fragments; b) a nucleotide sequence homologous to a nucleotide sequence such as defined in a); c) a nucleotide sequence complementary to a nucleotide sequence such as defined in a) or b), and a nucleotide sequence of their corresponding RNA; d) a nucleotide sequence capable of hybridizing under stringent conditions with a sequence such as defined in a), b) or c); e) a nucleotide sequence comprising a sequence such as defined in a), b), c) or d); and f) a nucleotide sequence modified by a nucleotide sequence such as defined in a), b), c), d) or e).

A “nucleotide, polynucleotide, or nucleic acid sequence” will be understood according to the present invention as meaning both a double-stranded or single-stranded DNA in the monomeric and dimeric (so-called “in tandem”) forms and the transcription products of the DNAs.

Aspects of the invention relate to nucleotide sequences in which it has been possible to isolate, purify or partially purify, starting from separation methods such as, for example, ion-exchange chromatography, by exclusion based on molecular size, or by affinity, or alternatively, fractionation techniques based on solubility in different solvents, or starting from methods of genetic engineering such as amplification, cloning, and subcloning, it being possible for the sequences of the invention to be carried by vectors.

An “isolated and/or purified nucleotide sequence fragment” according to the invention will be understood as designating any nucleotide fragment of the genome of Alicyclobacillus acidocaldarius, and may include, by way of non-limiting example, length of at least 8, 12, 20, 25, 50, 75, 100, 200, 300, 400, 500, 1000, or more, consecutive nucleotides of the sequence from which it originates.

A “specific fragment of an isolated and/or purified nucleotide sequence” according to the invention will be understood as designating any nucleotide fragment of the genome of Alicyclobacillus acidocaldarius, having, after alignment and comparison with the corresponding fragments of genomic sequences of Alicyclobacillus acidocaldarius, at least one nucleotide or base of different nature.

A “homologous isolated and/or purified nucleotide sequence” in the sense of the present invention is understood as meaning an isolated and/or purified a nucleotide sequence having at least a percentage identity with the bases of a nucleotide sequence according to the invention of at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.6%, or 99.7%, this percentage being purely statistical and it being possible to distribute the differences between the two nucleotide sequences at random and over the whole of their length.

A “specific homologous nucleotide sequence” in the sense of the present invention is understood as meaning a homologous nucleotide sequence having at least one nucleotide sequence of a specific fragment, such as defined above. The “specific” homologous sequences can comprise, for example, the sequences corresponding to the genomic sequence or to the sequences of its fragments representative of variants of the genome of Alicyclobacillus acidocaldarius. These specific homologous sequences can thus correspond to variations linked to mutations within strains of Alicyclobacillus acidocaldarius, and especially correspond to truncations, substitutions, deletions and/or additions of at least one nucleotide. The homologous sequences can likewise correspond to variations linked to the degeneracy of the genetic code.

The term “degree or percentage of sequence homology” refers to “degree or percentage of sequence identity between two sequences after optimal alignment” as defined in the present application.

Two amino acids or nucleotidic sequences are said to be “identical” if the sequence of amino acids or nucleotidic residues, in the two sequences is the same when aligned for maximum correspondence as described below. Sequence comparisons between two (or more) peptides or polynucleotides are typically performed by comparing sequences of two optimally aligned sequences over a segment or “comparison window” to identify and compare local regions of sequence similarity. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman, Ad. App. Math 2:482 (1981), by the homology alignment algorithm of Neddleman and Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. (U.S.A.) 85:2444 (1988), by computerized implementation of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wis.), or by visual inspection.

“Percentage of sequence identity” (or degree of identity) is determined by comparing two optimally aligned sequences over a comparison window, where the portion of the peptide or polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical amino acid residue or nucleic acid base occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.

The definition of sequence identity given above is the definition that would be used by one of skill in the art. The definition by itself does not need the help of any algorithm, the algorithms being helpful only to achieve the optimal alignments of sequences, rather than the calculation of sequence identity.

From the definition given above, it follows that there is a well defined and only one value for the sequence identity between two compared sequences, which value corresponds to the value obtained for the best or optimal alignment.

In the BLAST N or BLAST P or “BLAST 2 sequence,” software that is available at the website ncbi.nlm.nih.gov/gorf/b12.html, and habitually used by the inventors and in general by a skilled person for comparing and determining the identity between two sequences, gap cost, which depends on the sequence length to be compared, is directly selected by the software (i.e., 11.2 for substitution matrix BLOSUM-62 for length>85).

Complementary nucleotide sequence of a sequence of the invention is understood as meaning any DNA whose nucleotides are complementary to those of the sequence of the invention, and whose orientation is reversed (antisense sequence).

Hybridization under conditions of stringency with a nucleotide sequence according to the invention is understood as meaning hybridization under conditions of temperature and ionic strength chosen in such a way that they allow the maintenance of the hybridization between two fragments of complementary DNA.

By way of illustration, conditions of great stringency of the hybridization step with the aim of defining the nucleotide fragments as described above are advantageously obtained by the following.

The hybridization is carried out at a preferential temperature of 65° C. in the presence of SSC buffer, 1×SSC corresponding to 0.15 M NaCl and 0.05 M Na citrate. The washing steps, for example, can be the following: 2×SSC, at ambient temperature followed by two washes with 2×SSC, 0.5% SDS at 65° C.; 2×0.5×SSC, 0.5% SDS; at 65° C. for 10 minutes each.

The conditions of intermediate stringency, using, for example, a temperature of 42° C. in the presence of a 2×SSC buffer, or of less stringency, for example a temperature of 37° C. in the presence of a 2×SSC buffer, respectively, require a globally less significant complementarity for the hybridization between the two sequences.

The stringent hybridization conditions described above for a polynucleotide with a size of approximately 350 bases will be adapted by a person skilled in the art for oligonucleotides of greater or smaller size, according to the teachings of Sambrook et al., 1989.

Among the isolated and/or purified nucleotide sequences according to the invention, are those that can be used as a primer or probe in methods allowing the homologous sequences according to the invention to be obtained, these methods, such as the polymerase chain reaction (PCR), nucleic acid cloning, and sequencing, being well known to a person skilled in the art.

Among the isolated and/or purified nucleotide sequences according to the invention, those are again preferred that can be used as a primer or probe in methods allowing the presence of SEQ ID NOs:1, 18, 35, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, 438, 455, 457, 459, 461, or 463, one of their fragments, or one of their variants such as defined below to be diagnosed.

The nucleotide sequence fragments according to the invention can be obtained, for example, by specific amplification, such as PCR, or after digestion with appropriate restriction enzymes of nucleotide sequences according to the invention, these methods in particular being described in the work of Sambrook et al., 1989. Such representative fragments can likewise be obtained by chemical synthesis according to methods well known to persons of ordinary skill in the art.

“Modified nucleotide sequence” will be understood as meaning any nucleotide sequence obtained by mutagenesis according to techniques well known to a person skilled in the art, and containing modifications with respect to the normal sequences according to the invention, for example, mutations in the regulatory and/or promoter sequences of polypeptide expression, especially leading to a modification of the rate of expression of the polypeptide or to a modulation of the replicative cycle.

“Modified nucleotide sequence” will likewise be understood as meaning any nucleotide sequence coding for a modified polypeptide such as defined below.

The present invention relates to isolated and/or purified nucleotide sequences of Alicyclobacillus acidocaldarius, characterized in that they are selected from the sequences of SEQ ID NOs:1, 18, 35, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, 438, 455, 457, 459, 461, or 463 or one of their fragments.

Embodiments of the invention likewise relate to isolated and/or purified nucleotide sequences characterized in that they comprise a nucleotide sequence selected from: a) nucleotide sequences of SEQ ID NOs:1, 18, 35, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, 438, 455, 457, 459, 461, or 463 or one of their fragments; b) a nucleotide sequence of a specific fragment of a sequence such as defined in a); c) a homologous nucleotide sequence having at least 80% identity with a sequence such as defined in a) or b); d) a complementary nucleotide sequence or sequence of RNA corresponding to a sequence such as defined in a), b) or c); and e) a nucleotide sequence modified by a sequence such as defined in a), b), c) or d).

Among the isolated and/or purified nucleotide sequences according to the invention are the nucleotide sequences of SEQ ID NOs:8-12, 25-29, 41-45, 58-62, 75-79, 91-95, 108-112, 125-129, 142-146, 157-161, 174-178, 191-195, 208-212, 225-229, 242-246, 259-263, 276-280, 293-297, 310-314, 326-330, 343-347, 360-364, 377-381, 394-398, 411-415, 428-432, or 445-449 or fragments thereof and any other isolated and/or purified nucleotide sequences which have a homology of at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.6%, or 99.7% identity with the sequence SEQ ID NOs:1, 18, 35, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, 438, 455, 457, 459, 461, or 463 or fragments thereof. The homologous sequences can comprise, for example, the sequences corresponding to the genomic sequences Alicyclobacillus acidocaldarius. In the same manner, these specific homologous sequences can correspond to variations linked to mutations within strains of Alicyclobacillus acidocaldarius and especially correspond to truncations, substitutions, deletions and/or additions of at least one nucleotide.

Embodiments of the invention comprise the isolated and/or purified polypeptides encoded by a nucleotide sequence according to the invention, or fragments thereof, whose sequence is represented by a fragment. Amino acid sequences corresponding to the isolated and/or purified polypeptides can be encoded according to one of the three possible reading frames of the sequence SEQ ID NOs:1, 18, 35, 51, 68, 85, 101, 118, 135, 152, 167, 184, 201, 218, 235, 252, 269, 286, 303, 320, 336, 353, 370, 387, 404, 421, 438, 455, 457, 459, 461, or 463.

Embodiments of the invention likewise relate to the isolated and/or purified polypeptides, characterized in that they comprise a polypeptide selected from the amino acid sequences of SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, 439, 456, 458, 460, 462, or 464 or one of their fragments.

Among the isolated and/or purified polypeptides, according to embodiments of the invention, are the isolated and/or purified polypeptides of amino acid sequence SEQ ID NOs:13-17, 30-34, 46-50, 63-67, 80-84, 96-100, 113-117, 130-134, 147-151, 162-166, 179-183, 196-200, 213-217, 230-234, 247-251, 264-268, 281-285, 298-302, 315-319, 331-335, 348-352, 365-369, 382-386, 399-403, 416-420, 433-437, or 450-454 or fragments thereof or any other isolated and/or purified polypeptides which have a homology of at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.6%, or 99.7% identity with the sequence SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, 439, 456, 458, 460, 462, or 464 or fragments thereof.

Embodiments of the invention also relate to the polypeptides, characterized in that they comprise a polypeptide selected from: a) a specific fragment of at least five amino acids of a polypeptide of an amino acid sequence according to the invention; b) a polypeptide homologous to a polypeptide such as defined in a); c) a specific biologically active fragment of a polypeptide such as defined in a) or b); and d) a polypeptide modified by a polypeptide such as defined in a), b) or c).

In the present description, the terms polypeptide, peptide and protein are interchangeable.

In embodiments of the invention, the isolated and/or purified polypeptides according to the invention may be glycosylated, pegylated, and/or otherwise post-translationally modified. In further embodiments, glycosylation, pegylation, and/or other post-translational modifications may occur in vivo or in vitro and/or may be performed using chemical techniques. In additional embodiments, any glycosylation, pegylation and/or other post-translational modifications may be N-linked or O-linked.

In embodiments of the invention, any one of the isolated and/or purified polypeptides according to the invention may be enzymatically active at temperatures at or above about 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, and/or 95 degrees Celsius and/or may be enzymatically active at a pH at, below, and/or above 7, 6, 5, 4, 3, 2, 1, and/or 0. In further embodiments of the invention, glycosylation, pegylation, and/or other post-translational modification may be required for the isolated and/or purified polypeptides according to the invention to be enzymatically active at a pH at or below 7, 6, 5, 4, 3, 2, 1, and/or 0 or at temperatures at or above about 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, and/or 95 degrees Celsius.

Aspects of the invention relate to polypeptides that are isolated or obtained by purification from natural sources, or else obtained by genetic recombination, or alternatively, by chemical synthesis and, thus, they may contain unnatural amino acids, as will be described below.

A “polypeptide fragment” according to the embodiments of the invention is understood as designating a polypeptide containing at least five consecutive amino acids, preferably ten consecutive amino acids or fifteen consecutive amino acids.

In the present invention, a specific polypeptide fragment is understood as designating the consecutive polypeptide fragment encoded by a specific fragment nucleotide sequence according to the invention.

“Homologous polypeptide” will be understood as designating the polypeptides having, with respect to the natural polypeptide, certain modifications such as, in particular, a deletion, addition, or substitution of at least one amino acid, a truncation, a prolongation, a chimeric fusion, and/or a mutation. Among the homologous polypeptides, those are preferred whose amino acid sequence has at least 80% or 90%, homology with the sequences of amino acids of polypeptides according to the invention.

“Specific homologous polypeptide” will be understood as designating the homologous polypeptides, such as defined above, and having a specific fragment of polypeptide according to the invention.

In the case of a substitution, one or more consecutive or nonconsecutive amino acids are replaced by “equivalent” amino acids. The expression “equivalent” amino acid is directed here as designating any amino acid capable of being substituted by one of the amino acids of the base structure without, however, essentially modifying the biological activities of the corresponding peptides, such that they will be defined by the following. Examples of such substitutions in the amino acid sequences of SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, 439, 456, 458, 460, 462, or 464 may include those isolated and/or purified polypeptides of amino acid sequence SEQ ID NOs:13-17, 30-34, 46-50, 63-67, 80-84, 96-100, 113-117, 130-134, 147-151, 162-166, 179-183, 196-200, 213-217, 230-234, 247-251, 264-268, 281-285, 298-302, 315-319, 331-335, 348-352, 365-369, 382-386, 399-403, 416-420, 433-437, or 450-454.

These equivalent amino acids can be determined either by depending on their structural homology with the amino acids which they substitute, or on results of comparative tests of biological activity between the different polypeptides, which are capable of being carried out.

By way of non-limiting example, the possibilities of substitutions capable of being carried out without resulting in an extensive modification of the biological activity of the corresponding modified polypeptides will now be mentioned, the replacement, for example, of leucine by valine or isoleucine, of aspartic acid by glutamic acid, of glutamine by asparagine, of arginine by lysine, etc., the reverse substitutions naturally being envisageable under the same conditions.

In a further embodiment, substitutions are limited to substitutions in amino acids not conserved among other proteins which have similar identified enzymatic activity. For example, the figures herein provide sequence alignments between certain polypeptides of the invention and other polypeptides identified as having similar enzymatic activity, with amino acids common to three or more of the sequences aligned indicated in bold. Thus, according to one embodiment of the invention, substitutions or mutations may be made at positions that are not indicated as in bold in the figures. Examples of such polypeptides may include, but are not limited to, those found in the amino acid sequences of SEQ ID NOs:13-17, 30-34, 46-50, 63-67, 80-84, 96-100, 113-117, 130-134, 147-151, 162-166, 179-183, 196-200, 213-217, 230-234, 247-251, 264-268, 281-285, 298-302, 315-319, 331-335, 348-352, 365-369, 382-386, 399-403, 416-420, 433-437, or 450-454. In a further embodiment, nucleic acid sequences may be mutated or substituted such that the amino acid they encode is unchanged (degenerate substitutions and/or mutations) and/or mutated or substituted such that any resulting amino acid substitutions or mutations are made at positions that are not indicated as in bold in the figures. Examples of such nucleic acid sequences may include, but are not limited to, those found in the nucleotide sequences of SEQ ID NOs:13-17, 30-34, 46-50, 63-67, 80-84, 96-100, 113-117, 130-134, 147-151, 162-166, 179-183, 196-200, 213-217, 230-234, 247-251, 264-268, 281-285, 298-302, 315-319, 331-335, 348-352, 365-369, 382-386, 399-403, 416-420, 433-437, or 450-454 or fragments thereof.

The specific homologous polypeptides likewise correspond to polypeptides encoded by the specific homologous nucleotide sequences such as defined above, and thus comprise in the present definition the polypeptides that are mutated or correspond to variants that can exist in Alicyclobacillus acidocaldarius, and which especially correspond to truncations, substitutions, deletions, and/or additions of at least one amino acid residue.

“Specific biologically active fragment of a polypeptide” according to an embodiment of the invention will be understood in particular as designating a specific polypeptide fragment, such as defined above, as having at least one of the characteristics of polypeptides according to the invention. In certain embodiments, the peptide is capable of acting as an Alpha beta hydrolase, Alpha-glucosidase, Glucan 1,4-alpha-maltohydrolase, Glycosidase, Amylase, Acetyl esterase, Beta-galactosidase, Alpha amylase, Alpha-xylosidase, Cyclomaltodextrinase; Neopullulanase; Maltogenic alpha-amylase, Family 31 of glycosyl hydrolase, Alpha-L-arabinofuranosidase, Cell wall hydrolase, Altronate hydrolase, poly-1,4-alpha-D-galacturonide, Xylan alpha-1,2-glucuronosidase, Cellulase/Endoglucanase M, Polygalacturonase, Glycosyl hydrolase, Peptidoglycan hydrolase, N-acetylglucosaminidase, Endochitinase, Alpha-galactosidase, Endo-beta-1,4-mannanase, Cellobiose phosphorylase, Cyclic beta-1,2-glucan synthase, Glycogen debranching enzyme, Acetyl hydrolase, Beta-1,4-xylanase, Beta-glucosidase, 6-phospho-beta-glucosidase, Cinnamoyl ester hydrolase, Beta-glucuronidase, 3-hydroxyisobutyryl-CoA hydrolase, Beta-glucosidase B-related glycosidase, and/or Chitooligosaccharide deacetylase.

The polypeptide fragments according to embodiments of the invention can correspond to isolated or purified fragments naturally present in an Alicyclobacillus acidocaldarius or correspond to fragments that can be obtained by cleavage of the polypeptide by a proteolytic enzyme, such as trypsin or chymotrypsin or collagenase, or by a chemical reagent, such as cyanogen bromide (CNBr). Such polypeptide fragments can likewise just as easily be prepared by chemical synthesis, or from hosts transformed by an expression vector according to the invention containing a nucleic acid allowing the expression of the fragments and placed under the control of appropriate regulation and/or expression elements.

“Modified polypeptide” of a polypeptide according to an embodiment of the invention is understood as designating a polypeptide obtained by genetic recombination or by chemical synthesis, as will be described below, as having at least one modification with respect to the normal sequence. These modifications may or may not be able to bear on amino acids at the origin of specificity, and/or of activity, or at the origin of the structural conformation, localization, and of the capacity of membrane insertion of the polypeptide according to the invention. It will thus be possible to create polypeptides of equivalent, increased, or decreased activity, and of equivalent, narrower, or wider specificity. Among the modified polypeptides, it is necessary to mention the polypeptides in which up to five amino acids can be modified, truncated at the N- or C-terminal end, or even deleted or added.

The methods allowing modulations on eukaryotic or prokaryotic cells to be demonstrated are well known to the person of ordinary skill in the art. It is likewise well understood that it will be possible to use the nucleotide sequences coding for the modified polypeptides for the modulations, for example, through vectors according to the invention and described below.

The preceding modified polypeptides can be obtained by using combinatorial chemistry, in which it is possible to systematically vary parts of the polypeptide before testing them on models, cell cultures or microorganisms, for example, to select the compounds that are most active or have the properties sought.

Chemical synthesis likewise has the advantage of being able to use unnatural amino acids, or nonpeptide bonds.

Thus, in order to improve the duration of the life of the polypeptides according to the invention, it may be of interest to use unnatural amino acids, e.g., in D form, or else amino acid analogs, especially sulfur-containing forms, for example.

Finally, it will be possible to integrate the structure of the polypeptides according to the invention, its specific or modified homologous forms, into chemical structures of polypeptide types or others. Thus, it may be of interest to provide at the N- and C-terminal ends compounds not recognized by proteases.

The nucleotide sequences coding for a polypeptide according to the invention are likewise part of the invention.

The invention likewise relates to nucleotide sequences utilizable as a primer or probe, characterized in that the sequences are selected from the nucleotide sequences according to the invention.

It is well understood that the present invention, in various embodiments, likewise relates to specific polypeptides of Alicyclobacillus acidocaldarius, encoded by nucleotide sequences, capable of being obtained by purification from natural polypeptides, by genetic recombination or by chemical synthesis by procedures well known to a person skilled in the art and such as described in particular below. In the same manner, the labeled or unlabeled mono- or polyclonal antibodies directed against the specific polypeptides and encoded by the nucleotide sequences are also encompassed by the invention.

Embodiments of the invention additionally relate to the use of a nucleotide sequence according to the invention as a primer or probe for the detection and/or the amplification of nucleic acid sequences.

The nucleotide sequences according to embodiments of the invention can thus be used to amplify nucleotide sequences, especially by the PCR technique (polymerase chain reaction) (Erlich, 1989; Innis et al., 1990; Rolfs et al., 1991; and White et al., 1997).

These oligodeoxyribonucleotide or oligoribonucleotide primers advantageously have a length of at least eight nucleotides, preferably of at least twelve nucleotides, and even more preferentially at least twenty nucleotides.

Other amplification techniques of the target nucleic acid can be advantageously employed as alternatives to PCR.

The nucleotide sequences of the invention, in particular the primers according to the invention, can likewise be employed in other procedures of amplification of a target nucleic acid, such as: the TAS technique (Transcription-based Amplification System), described by Kwoh et al. in 1989; the 3SR technique (Self-Sustained Sequence Replication), described by Guatelli et al. in 1990; the NASBA technique (Nucleic Acid Sequence Based Amplification), described by Kievitis et al. in 1991; the SDA technique (Strand Displacement Amplification) (Walker et al., 1992); the TMA technique (Transcription Mediated Amplification).

The polynucleotides of the invention can also be employed in techniques of amplification or of modification of the nucleic acid serving as a probe, such as: the LCR technique (Ligase Chain Reaction), described by Landegren et al. in 1988 and improved by Barany et al. in 1991, which employs a thermostable ligase; the RCR technique (Repair Chain Reaction), described by Segev in 1992; the CPR technique (Cycling Probe Reaction), described by Duck et al. in 1990; the amplification technique with Q-beta replicase, described by Miele et al. in 1983 and especially improved by Chu et al. in 1986, Lizardi et al. in 1988, then by Burg et al., as well as by Stone et al. in 1996.

In the case where the target polynucleotide to be detected is possibly an RNA, for example, an mRNA, it will be possible to use, prior to the employment of an amplification reaction with the aid of at least one primer according to the invention or to the employment of a detection procedure with the aid of at least one probe of the invention, an enzyme of reverse transcriptase type in order to obtain a cDNA from the RNA contained in the biological sample. The cDNA obtained will thus serve as a target for the primer(s) or the probe(s) employed in the amplification or detection procedure according to the invention.

The detection probe will be chosen in such a manner that it hybridizes with the target sequence or the amplicon generated from the target sequence. By way of sequence, such a probe will advantageously have a sequence of at least twelve nucleotides, in particular of at least twenty nucleotides, and preferably of at least 100 nucleotides.

Embodiments of the invention also comprise the nucleotide sequences utilizable as a probe or primer according to the invention, characterized in that they are labeled with a radioactive compound or with a nonradioactive compound.

The unlabeled nucleotide sequences can be used directly as probes or primers, although the sequences are generally labeled with a radioactive element (³²P, ³⁵S, ³H, ¹²⁵I) or with a nonradioactive molecule (biotin, acetylaminofluorene, digoxigenin, 5-bromodeoxyuridine, fluorescein) to obtain probes that are utilizable for numerous applications.

Examples of nonradioactive labeling of nucleotide sequences are described, for example, in French Patent No. 7810975 or by Urdea et al. or by Sanchez-Pescador et al. in 1988.

In the latter case, it will also be possible to use one of the labeling methods described in patents FR-2 422 956 and FR-2 518 755.

The hybridization technique can be carried out in various manners (Matthews et al., 1988). The most general method consists in immobilizing the nucleic acid extract of cells on a support (such as nitrocellulose, nylon, polystyrene) and in incubating, under well-defined conditions, the immobilized target nucleic acid with the probe. After hybridization, the excess of probe is eliminated and the hybrid molecules formed are detected by the appropriate method (measurement of the radioactivity, of the fluorescence or of the enzymatic activity linked to the probe).

The invention, in various embodiments, likewise comprises the nucleotide sequences according to the invention, characterized in that they are immobilized on a support, covalently or noncovalently.

According to another advantageous mode of employing nucleotide sequences according to the invention, the latter can be used by being immobilized on a support and can thus serve to capture, by specific hybridization, the target nucleic acid obtained from the biological sample to be tested. If necessary, the solid support is separated from the sample and the hybridization complex is formed between the capture probe. The target nucleic acid is then detected with the aid of a second probe, a so-called “detection probe,” and labeled with an easily detectable element.

Another aspect of the present invention is a vector for the cloning and/or expression of a sequence, characterized in that it contains a nucleotide sequence according to the invention.

The vectors according to the invention, characterized in that they contain the elements allowing the expression and/or the secretion of the nucleotide sequences in a determined host cell, are likewise part of the invention.

The vector may then contain a promoter, signals of initiation and termination of translation, as well as appropriate regions of regulation of transcription. It may be able to be maintained stably in the host cell and can optionally have particular signals specifying the secretion of the translated protein. These different elements may be chosen as a function of the host cell used. To this end, the nucleotide sequences according to the invention may be inserted into autonomous replication vectors within the chosen host, or integrated vectors of the chosen host.

Such vectors will be prepared according to the methods currently used by a person skilled in the art, and it will be possible to introduce the clones resulting therefrom into an appropriate host by standard methods, such as, for example, lipofection, electroporation, and thermal shock.

The vectors, according to the invention, are, for example, vectors of plasmid or viral origin. One example of a vector for the expression of polypeptides of the invention is baculovirus.

These vectors are useful for transforming host cells in order to clone or to express the nucleotide sequences of the invention.

The invention likewise comprises the host cells transformed by a vector according to the invention.

These cells can be obtained by the introduction into host cells of a nucleotide sequence inserted into a vector, such as defined above, and then the culturing of the cells under conditions allowing the replication and/or expression of the transfected nucleotide sequence.

The host cell can be selected from prokaryotic or eukaryotic systems, such as, for example, bacterial cells (Olins and Lee, 1993), but likewise yeast cells (Buckholz, 1993), as well as plant cells, such as Arabidopsis sp., and animal cells, in particular the cultures of mammalian cells (Edwards and Aruffo, 1993), for example, Chinese hamster ovary (CHO) cells, but likewise the cells of insects in which it is possible to use procedures employing baculoviruses, for example, Sf9 insect cells (Luckow, 1993).

Embodiments of the invention likewise relate to organisms comprising one of the transformed cells according to the invention.

The obtainment of transgenic organisms according to the invention overexpressing one or more of the genes of Alicyclobacillus acidocaldarius or part of the genes may be carried out in, for example, rats, mice, or rabbits according to methods well known to a person skilled in the art, such as by viral or nonviral transfections. It will be possible to obtain the transgenic organisms overexpressing one or more of the genes by transfection of multiple copies of the genes under the control of a strong promoter of ubiquitous nature, or selective for one type of tissue. It will likewise be possible to obtain the transgenic organisms by homologous recombination in embryonic cell strains, transfer of these cell strains to embryos, selection of the affected chimeras at the level of the reproductive lines, and growth of the chimeras.

The transformed cells, as well as the transgenic organisms according to the invention, are utilizable in procedures for preparation of recombinant polypeptides.

It is today possible to produce recombinant polypeptides in a relatively large quantity by genetic engineering, for example, using the cells transformed by expression vectors according to the invention or using transgenic organisms according to the invention.

The procedures for preparation of a polypeptide of the invention in recombinant form, characterized in that they employ a vector and/or a cell transformed by a vector according to the invention and/or a transgenic organism comprising one of the transformed cells according to the invention are themselves comprised in the present invention.

As used herein, “transformation” and “transformed” relate to the introduction of nucleic acids into a cell, whether prokaryotic or eukaryotic. Further, “transformation” and “transformed,” as used herein, need not relate to growth control or growth deregulation.

Among the procedures for preparation of a polypeptide of the invention in recombinant form, the preparation procedures include employing a vector, and/or a cell transformed by the vector and/or a transgenic organism comprising one of the transformed cells, containing a nucleotide sequence according to the invention of coding for a polypeptide of Alicyclobacillus acidocaldarius.

A variant according to the invention may consist of producing a recombinant polypeptide fused to a “carrier” protein (chimeric protein). The advantage of this system is that it may allow stabilization of and/or a decrease in the proteolysis of the recombinant product, an increase in the solubility in the course of renaturation in vitro and/or a simplification of the purification when the fusion partner has an affinity for a specific ligand.

More particularly, the invention relates to a procedure for preparation of a polypeptide of the invention comprising the following steps: a) culture of transformed cells under conditions allowing the expression of a recombinant polypeptide of nucleotide sequence according to the invention; and b) if need be, recovery of the recombinant polypeptide.

When the procedure for preparation of a polypeptide of the invention employs a transgenic organism according to the invention, the recombinant polypeptide is then extracted from the organism.

The invention also relates to a polypeptide, which is capable of being obtained by a procedure of the invention, such as described previously.

The invention also comprises a procedure for preparation of a synthetic polypeptide, characterized in that it uses a sequence of amino acids of polypeptides according to the invention.

The invention likewise relates to a synthetic polypeptide obtained by a procedure according to the invention.

The polypeptides according to the invention can likewise be prepared by techniques, which are conventional in the field of the synthesis of peptides. This synthesis can be carried out in homogeneous solution or in solid phase.

For example, recourse can be made to the technique of synthesis in homogeneous solution described by Houben-Weyl in 1974.

This method of synthesis consists in successively condensing, two by two, the successive amino acids in the order required, or in condensing amino acids and fragments formed previously and already containing several amino acids in the appropriate order, or alternatively, several fragments previously prepared in this way, it being understood that it will be necessary to protect beforehand all the reactive functions carried by these amino acids or fragments, with the exception of amine functions of one and carboxyls of the other or vice-versa, which must normally be involved in the formation of peptide bonds, especially after activation of the carboxyl function, according to the methods well known in the synthesis of peptides.

Recourse may also be made to the technique described by Merrifield in 1966.

To make a peptide chain according to the Merrifield procedure, recourse is made to a very porous polymeric resin, on which is immobilized the first C-terminal amino acid of the chain. This amino acid is immobilized on a resin through its carboxyl group and its amine function is protected. The amino acids that are going to form the peptide chain are thus immobilized, one after the other, on the amino group, which is deprotected beforehand each time, of the portion of the peptide chain already formed, and which is attached to the resin. When the whole of the desired peptide chain has been formed, the protective groups of the different amino acids forming the peptide chain are eliminated and the peptide is detached from the resin with the aid of an acid.

The invention additionally relates to hybrid polypeptides having at least one polypeptide according to the invention, and a sequence of a polypeptide capable of inducing an immune response in man or animals.

Advantageously, the antigenic determinant is such that it is capable of inducing a humoral and/or cellular response.

It will be possible for such a determinant to comprise a polypeptide according to the invention in a glycosylated, pegylated, and/or otherwise post-translationally modified form used with a view to obtaining immunogenic compositions capable of inducing the synthesis of antibodies directed against multiple epitopes.

These hybrid molecules can be formed, in part, of a polypeptide carrier molecule or of fragments thereof according to the invention, associated with a possibly immunogenic part, in particular an epitope of the diphtheria toxin, the tetanus toxin, a surface antigen of the hepatitis B virus (patent FR 79 21811), the VP1 antigen of the poliomyelitis virus or any other viral or bacterial toxin or antigen.

The procedures for synthesis of hybrid molecules encompass the methods used in genetic engineering for constructing hybrid nucleotide sequences coding for the polypeptide sequences sought. It will be possible, for example, to refer advantageously to the technique for obtainment of gene coding for fusion proteins described by Minton in 1984.

The hybrid nucleotide sequences coding for a hybrid polypeptide, as well as the hybrid polypeptides according to the invention characterized in that they are recombinant polypeptides obtained by the expression of the hybrid nucleotide sequences, are likewise part of the invention.

The invention likewise comprises the vectors characterized in that they contain one of the hybrid nucleotide sequences. The host cells transformed by the vectors, the transgenic organisms comprising one of the transformed cells as well as the procedures for preparation of recombinant polypeptides using the vectors, the transformed cells and/or the transgenic organisms are, of course, likewise part of the invention.

The polypeptides according to the invention, the antibodies according to the invention, described below, and the nucleotide sequences according to the invention can advantageously be employed in procedures for the detection and/or identification of Alicyclobacillus acidocaldarius, in a sample capable of containing them. These procedures, according to the specificity of the polypeptides, the antibodies and the nucleotide sequences, according to the invention, which will be used, will in particular be able to detect and/or to identify an Alicyclobacillus acidocaldarius.

The polypeptides according to the invention can advantageously be employed in a procedure for the detection and/or the identification of Alicyclobacillus acidocaldarius in a sample capable of containing them, characterized in that it comprises the following steps: a) contacting of this sample with a polypeptide or one of its fragments according to the invention (under conditions allowing an immunological reaction between the polypeptide and the antibodies possibly present in the biological sample); and b) demonstration of the antigen-antibody complexes possibly formed.

Any conventional procedure can be employed for carrying out such a detection of the antigen-antibody complexes possibly formed.

By way of example, a preferred method brings into play immunoenzymatic processes according to the ELISA technique, by immunofluorescence, or radioimmunological assay processes (RIA), or their equivalent.

Thus, the invention likewise relates to the polypeptides according to the invention, labeled with the aid of an adequate label such as of the enzymatic, fluorescent or radioactive type.

Such methods comprise, for example, the following steps: deposition of determined quantities of a polypeptide composition according to the invention in the wells of a microtiter plate, introduction into the wells of increasing dilutions of serum, or of a biological sample other than that defined previously, having to be analyzed, incubation of the microplate, introduction into the wells of the microtiter plate of labeled antibodies directed against pig immunoglobulins, the labeling of these antibodies having been carried out with the aid of an enzyme selected from those that are capable of hydrolyzing a substrate by modifying the absorption of the radiation of the latter, at least at a determined wavelength, for example, at 550 nm, detection, by comparison with a control test, of the quantity of hydrolyzed substrate.

The polypeptides according to the invention allow monoclonal or polyclonal antibodies to be prepared, which are characterized in that they specifically recognize the polypeptides according to the invention. It will advantageously be possible to prepare the monoclonal antibodies from hybridomas according to the technique described by Kohler and Milstein in 1975. It will be possible to prepare the polyclonal antibodies, for example, by immunization of an animal, in particular a mouse, with a polypeptide or a DNA, according to the invention, associated with an adjuvant of the immune response, and then purification of the specific antibodies contained in the serum of the immunized animals on an affinity column on which the polypeptide that has served as an antigen has previously been immobilized. The polyclonal antibodies according to the invention can also be prepared by purification, on an affinity column on which a polypeptide according to the invention has previously been immobilized, of the antibodies contained in the serum of an animal immunologically challenged by Alicyclobacillus acidocaldarius, or a polypeptide or fragment according to the invention.

The invention likewise relates to mono- or polyclonal antibodies or their fragments, or chimeric antibodies, characterized in that they are capable of specifically recognizing a polypeptide according to the invention.

It will likewise be possible for the antibodies of the invention to be labeled in the same manner as described previously for the nucleic probes of the invention, such as a labeling of enzymatic, fluorescent or radioactive type.

The invention is additionally directed at a procedure for the detection and/or identification of Alicyclobacillus acidocaldarius in a sample, characterized in that it comprises the following steps: a) contacting of the sample with a mono- or polyclonal antibody according to the invention (under conditions allowing an immunological reaction between the antibodies and the polypeptides of Alicyclobacillus acidocaldarius possibly present in the biological sample); and b) demonstration of the antigen-antibody complex possibly formed.

The present invention likewise relates to a procedure for the detection and/or the identification of Alicyclobacillus acidocaldarius in a sample, characterized in that it employs a nucleotide sequence according to the invention.

More particularly, the invention relates to a procedure for the detection and/or the identification of Alicyclobacillus acidocaldarius in a sample, characterized in that it contains the following steps: a) if need be, isolation of the DNA from the sample to be analyzed; b) specific amplification of the DNA of the sample with the aid of at least one primer, or a pair of primers, according to the invention; and c) demonstration of the amplification products.

These can be detected, for example, by the technique of molecular hybridization utilizing a nucleic probe according to the invention. This probe will advantageously be labeled with a nonradioactive (cold probe) or radioactive element.

For the purposes of the present invention, “DNA of the biological sample” or “DNA contained in the biological sample” will be understood as meaning either the DNA present in the biological sample considered, or possibly the cDNA obtained after the action of an enzyme of reverse transcriptase type on the RNA present in the biological sample.

A further embodiment of the invention comprises a method, characterized in that it comprises the following steps: a) contacting of a nucleotide probe according to the invention with a biological sample, the DNA contained in the biological sample having, if need be, previously been made accessible to hybridization under conditions allowing the hybridization of the probe with the DNA of the sample; and b) demonstration of the hybrid formed between the nucleotide probe and the DNA of the biological sample.

The present invention also relates to a procedure according to the invention, characterized in that it comprises the following steps: a) contacting of a nucleotide probe immobilized on a support according to the invention with a biological sample, the DNA of the sample having, if need be, previously been made accessible to hybridization, under conditions allowing the hybridization of the probe with the DNA of the sample; b) contacting of the hybrid formed between the nucleotide probe immobilized on a support and the DNA contained in the biological sample, if need be after elimination of the DNA of the biological sample that has not hybridized with the probe, with a nucleotide probe labeled according to the invention; and c) demonstration of the novel hybrid formed in step b).

According to an advantageous embodiment of the procedure for detection and/or identification defined previously, this is characterized in that, prior to step a), the DNA of the biological sample is first amplified with the aid of at least one primer according to the invention.

Further embodiments of the invention comprise methods of at least partially degrading, cleaving, and/or removing a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylan, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating group. Degrading, cleaving, and/or removing these structures have in the art recognized utility such as those described in Mielenz 2001; Jeffries 1996; Shallom and Shoham 2003; Lynd et al. 2002; Vieille and Zeikus 2001; Bertoldo et al. 2004; and/or Malherbe and Cloete 2002.

Embodiments of methods include placing a recombinant, purified, and/or isolated polypeptide selected from the group consisting of a polypeptide having at least 90% sequence identity to SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, or 439; at least 93% sequence identity to SEQ ID NO:462; at least 94% sequence identity to SEQ ID NO:36; at least 96% sequence identity to SEQ ID NO:460; at least 99% sequence identity to SEQ ID NO:464; at least 99.6% sequence identity to SEQ ID NO:458; and at least 99.7% sequence identity to SEQ ID NO:456 in fluid contact with a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylan, glycoside, xylan-, glucan-, galactan-, and/or mannan-decorating group.

Further embodiments of methods include placing a cell producing or encoding a recombinant, purified, and/or isolated polypeptide selected from the group consisting of a polypeptide having at least 90% sequence identity to SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, or 439; at least 93% sequence identity to SEQ ID NO:462; at least 94% sequence identity to SEQ ID NO:36; at least 96% sequence identity to SEQ ID NO:460; at least 99% sequence identity to SEQ ID NO:464; at least 99.6% sequence identity to SEQ ID NO:458; and at least 99.7% sequence identity to SEQ ID NO:456 in fluid contact with a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylan, glycoside, xylan-, glucan-, galactan-, and/or mannan-decorating group.

As used herein, “partially degrading” relates to the rearrangement or cleavage of chemical bonds in the target structure.

In additional embodiments, methods of at least partially degrading, cleaving, and/or removing a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylan, glycoside, xylan-, glucan-, galactan-, and/or mannan-decorating group may take place at temperatures at or above about 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, and/or 95 degrees Celsius and/or at a pH at, below, and/or above 7, 6, 5, 4, 3, 2, 1, and/or 0.

Further embodiments of the invention may comprise a kit for at least partially degrading, cleaving, and/or removing a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylan, glycoside, xylan-, glucan-, galactan-, and/or mannan-decorating group, the kit comprising a cell producing or encoding a recombinant, purified, and/or isolated a polypeptide selected from the group consisting of a polypeptide having at least 90% sequence identity to SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, or 439; at least 93% sequence identity to SEQ ID NO:462; at least 94% sequence identity to SEQ ID NO:36; at least 96% sequence identity to SEQ ID NO:460; at least 99% sequence identity to SEQ ID NO:464; at least 99.6% sequence identity to SEQ ID NO:458; and at least 99.7% sequence identity to SEQ ID NO:456 and/or a recombinant, purified, and/or isolated a polypeptide selected from the group consisting of a polypeptide having at least 90% sequence identity to SEQ ID NOs:2, 19, 52, 69, 86, 102, 119, 136, 153, 168, 185, 202, 219, 236, 253, 270, 287, 304, 321, 337, 354, 371, 388, 405, 422, or 439; at least 93% sequence identity to SEQ ID NO:462; at least 94% sequence identity to SEQ ID NO:36; at least 96% sequence identity to SEQ ID NO:460; at least 99% sequence identity to SEQ ID NO:464; at least 99.6% sequence identity to SEQ ID NO:458; and at least 99.7% sequence identity to SEQ ID NO:456.

The invention is described in additional detail in the following illustrative examples. Although the examples may represent only selected embodiments of the invention, it should be understood that the following examples are illustrative and not limiting.

In embodiments of the invention the any one of the isolated and/or purified polypeptides according to the invention may be enzymatically active at temperatures at or above about 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, and/or 95 degrees Celsius and/or may be enzymatically active at a pH at, below, and/or above 7, 6, 5, 4, 3, 2, 1, and/or 0. In further embodiments of the invention, glycosylation, pegylation, and/or other post-translational modification may be required for the isolated and/or purified polypeptides according to the invention to be enzymatically active at a pH at or below 7, 6, 5, 4, 3, 2, 1, and/or 0 or at a temperature at or above about 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, and/or 95 degrees Celsius.

EXAMPLES Example 1: RAAC00169: An Esterase of the Alpha-Beta Hydrolase Superfamily

Provided in SEQ ID NO:1 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:2. As can be seen in FIGS. 1A and 1B, SEQ ID NO:2 aligns well with other proteins identified as esterases of the alpha-beta hydrolase superfamily. Of particular importance, it is noted that where amino acids are conserved in other esterases of the alpha-beta hydrolase superfamily, those amino acids are generally conserved in SEQ ID NO:2. Thus, the polypeptide provided in SEQ ID NO:2 is properly classified as an esterase of the alpha-beta hydrolase superfamily.

The polypeptides of SEQ ID NOs:13-17 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:2 and are encoded by nucleotide sequences of SEQ ID NOs:8-12, respectively.

The nucleotide sequences of SEQ ID NOs:1 and 8-12 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:1 and 8-12 produce the polypeptides of SEQ ID NOs:2 and 13-17. The polypeptides of SEQ ID NOs:2 and 13-17 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:2 and 13-17 are then demonstrated to have activity as esterases.

The isolated and/or purified polypeptides of SEQ ID NOs:2 and 13-17 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:2 and 13-17 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 2: RAAC00501: An Alpha-Beta Hydrolase

Provided in SEQ ID NO:18 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:19. As can be seen in FIGS. 2A and 2B, SEQ ID NO:19 aligns well with other proteins identified as alpha-beta hydrolases. Of particular importance, it is noted that where amino acids are conserved in other alpha-beta hydrolases, those amino acids are generally conserved in SEQ ID NO:19. Thus, the polypeptide provided in SEQ ID NO:19 is properly classified as an alpha-beta hydrolase.

The polypeptides of SEQ ID NOs:30-34 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:19 and are encoded by the nucleotide sequences of SEQ ID NOs:25-29, respectively.

The nucleotide sequences of SEQ ID NOs:18 and 25-29 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:18 and 25-29 produce the polypeptides of SEQ ID NOs:19 and 30-34. The polypeptides of SEQ ID NOs:19 and 30-34 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:19 and 30-34 are then demonstrated to have activity as alpha-beta hydrolases.

The isolated and/or purified polypeptides of SEQ ID NOs:19 and 30-34 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:19 and 30-34 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan, and/or mannan-decorating groups.

Example 3: RAAC00568: An Alpha-Glucosidase

Provided in SEQ ID NO:35 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:36. As can be seen in FIGS. 3A, 3B, and 3C, SEQ ID NO:36 aligns well with other proteins identified as alpha-glucosidases. Of particular importance, it is noted that where amino acids are conserved in other alpha-glucosidases, those amino acids are generally conserved in SEQ ID NO:36. Thus, the polypeptide provided in SEQ ID NO:36 is properly classified as an alpha-glucosidase.

The polypeptides of SEQ ID NOs:46-50 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:36 and are encoded by nucleotide sequences of SEQ ID NOs: 41-45, respectively.

The nucleotide sequences of SEQ ID NOs:35 and 41-45 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:35 and 41-45 produce the polypeptides of SEQ ID NOs:36 and 46-50. The polypeptides of SEQ ID NOs:36 and 46-50 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:36 and 46-50 are then demonstrated to have activity as alpha-glucosidases.

The isolated and/or purified polypeptides of SEQ ID NOs:36 and 46-50 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:36 and 46-50 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 4: Production and Purification of RAAC00568: An Alpha-Glucosidase

The nucleotide sequence of SEQ ID NO:35 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:35 encodes the polypeptide of SEQ ID NO:36. SEQ ID NO:35 was cloned into the pBAD/HIS A expression vector for E. coli and the pPIC6α A expression vector for P. pastoris and provided to E. coli and P. pastoris via electroporation and heat shock into competent cells, respectively. Expression of SEQ ID NO:36 was detected from both transformed E. coli and P. pastoris comprising SEQ ID NO:35 and RAAC00568 was affinity purified using a cobalt resin from these sources for activity testing.

Example 5: Alpha-Glucosidase Activity of RAAC00568

RAAC00568 purified from P. pastoris was tested for alpha-glucosidase activity using an assay summarized as follows:

A stock solution of p-nitrophenyl α-glucopyranoside (Sigma Cat. No. N1377) was prepared by adding 90.375 mg to 10 mL of water. This stock was diluted 1:15 in 50 mM sodium acetate buffer of pH 2.0, 3.5, and 5.5.

Samples of purified RAAC00568 generated in Example 4 were diluted 1:5, 1:10, 1:20, and 1:50 in 50 mM sodium acetate buffer of pH 2.0, 3.5, and 5.5. Samples (RAAC00568 samples and positive controls) were placed in the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. One hundred ninety μL of p-nitrophenyl α-glucopyranoside solution, preheated to 60 or 80 degrees Celsius, was then added to each well and the plate was further incubated at 60 or 80 degrees Celsius for an additional 10 minutes. One hundred μL of 2M sodium carbonate was then added to each well and the α-glucosidase activity was measured in a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 405 nm.

Specific activity for RAAC00568 as determined appears in Table 1.

TABLE 1 ASSAY SPECIFIC ACTIVITY Alpha-glucosidase P. pastoris pH 3.5, 60° C. 2.5 μmol/min mg pH 5.5, 60° C. 1.4 μmol/min mg pH 3.5, 80° C. 2.8 μmol/min mg pH 2.0, 60° C. 2.4 μmol/min mg

Example 6: Alpha-Xylosidase Activity of RAAC00568

RAAC00307 purified from P. pastoris was tested for xylosidase activity using a fluorescent assay summarized as follows:

A solution of p-nitrophenyl α-xylopyranoside (Sigma Cat. No. N1895) was created by diluting 50 mg of p-nitrophenyl α-xylopyranoside in 2 mL methanol. Individual aliquots of this solution were then diluted 1:50 with 50 mM sodium acetate buffer of pH 2.0, 3.5, and 5.5.

Samples of purified RAAC00568 generated in Example 5 were diluted 1:5, 1:10, 1:20, and 1:50 in 50 mM sodium acetate buffer of pH 2.0, 3.5, and 5.5. Samples (RAAC00568 samples and positive controls) were placed in the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. One hundred ninety μL of α-xylopyranoside solution, preheated to 60 or 80 degrees Celsius, was then added to each well and the plate was further incubated at 60 or 80 degrees Celsius for an additional 10 minutes. One hundred μL of 2.0 M sodium carbonate was then added to each well and the α-xylosidase activity was measured in a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 405 nm.

Specific activity for RAAC00568 as determined appears in Table 2.

TABLE 2 ASSAY SPECIFIC ACTIVITY Alpha-glucosidase P. pastoris pH 3.5, 60° C. 2.5 μmol/min mg pH 5.5, 60° C. 6.2 μmol/min mg pH 3.5, 80° C. 14 μmol/min mg pH 2.0, 60° C. 1.36 μmol/min mg

Example 7: RAAC00594

Provided in SEQ ID NO:51 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:52. As can be seen in FIGS. 4A, 4B, and 4C, SEQ ID NO:52 aligns well with other proteins identified as alpha-xylosidases. Of particular importance, it is noted that where amino acids are conserved in other alpha-xylosidases, those amino acids are generally conserved in SEQ ID NO:52.

The polypeptides of SEQ ID NOs:63-67 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:52 and are encoded by nucleotide sequences of SEQ ID NOs:58-62, respectively.

The nucleotide sequences of SEQ ID NOs:51 and 58-62 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:51 and 58-62 produce the polypeptides of SEQ ID NOs:52 and 63-67. The polypeptides of SEQ ID NOs:52 and 63-67 are then isolated and/or purified.

Example 8: Production and Purification of RAAC00594

The nucleotide sequence of SEQ ID NO:51 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:51 encodes the polypeptide of SEQ ID NO:52. SEQ ID NO:51 was cloned into the pBAD/HIS A expression vector for E. coli and provided to E. coli via electroporation into competent cells, respectively. Expression of SEQ ID NO:52 was detected from both transformed E. coli comprising SEQ ID NO:51 and RAAC00594 was affinity purified using a cobalt resin from these sources for activity testing.

Example 9: RAAC00602: An Alpha-L-Arabinofuranosidase

Provided in SEQ ID NO:68 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:69. As can be seen in FIGS. 5A and 5B, SEQ ID NO:69 aligns well with other proteins identified as alpha-L-arabinofuranosidases. Of particular importance, it is noted that where amino acids are conserved in other alpha-L-arabinofuranosidases, those amino acids are generally conserved in SEQ ID NO:69. Thus, the polypeptide provided in SEQ ID NO:69 is properly classified as an alpha-L-arabinofuranosidase.

The polypeptides of SEQ ID NOs:80-84 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:69 and are encoded by nucleotide sequences of SEQ ID NOs: 75-79, respectively.

The nucleotide sequences of SEQ ID NOs:68 and 75-79 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:68 and 75-79 produce the polypeptides of SEQ ID NOs:69 and 80-84. The polypeptides of SEQ ID NOs:69 and 80-84 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:69 and 80-84 are then demonstrated to have activity as alpha-L-arabinofuranosidases.

The isolated and/or purified polypeptides of SEQ ID NOs:69 and 80-84 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:69 and 80-84 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 10: Production and Purification of RAAC00602: An Alpha-L-Arabinofuranosidase

The nucleotide sequence of SEQ ID NO:68 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:68 encodes the polypeptide of SEQ ID NO:69. SEQ ID NO:68 was cloned into the pBAD/HIS A expression vector for E. coli and the pPIC6α A expression vector for P. pastoris and provided to E. coli and P. pastoris via electroporation and/or heat shock into competent cells. Expression of SEQ ID NO:69 was detected from both transformed E. coli and P. pastoris comprising SEQ ID NO:68 and RAAC00602 was affinity purified using a cobalt resin from these sources for activity testing.

Example 11: Alpha-L-Arabinofuranosidase Activity of RAAC00602

RAAC00602 purified from E. coli and P. pastoris was tested for alpha-L-arabinofuranosidase activity using an assay summarized as follows:

A solution of p-nitrophenyl α-L-arabinofuranoside (Sigma Cat. No. N3641) was created by diluting 271.2 mg of p-nitrophenyl α-arabinofuranoside in 10 mL methanol. Individual aliquots of this solution were then diluted 1:50 with in an appropriate buffer at 50 mM for pHs ranging from 1 to 9. Buffers included maleic acid (pH 1.0-2.0), Glycine HCl (pH 3.0), sodium acetate (pH 3.5-5.0), sodium phosphate (pH 6.0-8.0), and Tris-HCl (pH 9.0).

Samples of purified RAAC00602 generated in Example 10 were diluted 1:5, 1:10; 1:20, and 1:50 in the appropriate buffer at 50 mM for pHs ranging from 1 to 9. Samples (RAAC00602 samples and positive controls) were placed the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. One hundred ninety μL of p-nitrophenyl α-arabinofuranoside solution, preheated to 50, 60, 70, 80, or 90 degrees Celsius, was then added to each well and the plate was further incubated at 50, 60, 70, 80, or 90 degrees Celsius for 3 minutes. One hundred μL of 2.0 M sodium carbonate was then added to each well and the α-L-arabinofuranosidase activity was measured in a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 405 nm.

Specific activity for RAAC00602 for some pH and temperature combinations, appears in Table 3, while FIGS. 28 and 29 present the results for a full range of temperature and pH combinations.

TABLE 3 ASSAY SPECIFIC SPECIFIC α-L- ACTIVITY ACTIVITY arabinofuranosidase P. pastoris E. coli pH 3.5 60° C. 5.54 μmol/min mg 15.2 μmol/min mg pH 2.0 60° C.  0.1 μmol/min mg 0.07 μmol/min mg pH 3.5 80° C. 3.53 μmol/min mg 9.77 μmol/min mg pH 2.0 80° C. 1.46 μmol/min mg 0 μmol/min mg

Example 12: Beta-Xylosidase Activity of RAAC00602

RAAC00602 purified from E. coli and P. pastoris was tested for beta-xylosidase activity using a fluorescent assay summarized as follows:

A solution of MUXyl (4-methylumbelliferyl β-D-xylopyranoside) (Sigma M7008-1G CAS #6734-33-4) was created by dissolving 10 mg (0.01 g) MUXyl in 1 mL dimethyl sulfoxide (DMSO). Individual aliquots of the DMSO solution were then diluted 1:100 with 50 mM sodium acetate buffer of pH 2.0 and 3.5.

Samples of purified RAAC00602 generated in Example 10 were diluted 1:5, 1:10, 1:20, and 1:50 in 50 mM sodium acetate at pH 2.0 and 3.5. β-xylosidase from A. niger (Sigma X3501-5UN CAS #9025-530) was diluted 1:100 in 50 mM sodium acetate at pH 2.0 and 3.5 as positive controls. Samples (RAAC00602 samples and positive controls) were placed the wells of a 96-well plate in 50 μL aliquots. Blanks of buffer only were placed in some wells. The plate was then preheated to 60 or 80 degrees Celsius for 5 minutes. Ten μL of MUXyl solution was then added to each well and the plate was further incubated at 60 or 80 degrees Celsius for 3 minutes. One hundred μL of 0.5 M sodium carbonate was then added to each well and the β-xylosidase activity measured in a 96-well plate reader (SpectraMAX® Gemini) at an excitation of 355 nm and an emission of 460 nm. Specific activity for RAAC00602 as determined appears in Table 4.

TABLE 4 SPECIFIC SPECIFIC ASSAY ACTIVITY ACTIVITY β-xylosidase P. pastoris E. coli pH 3.5 60° C. 2.5 μmol/min mg pH 2.0 60° C. 1.2 μmol/min mg pH 2.0 80° C. 0.7 μmol/min mg

Example 13: RAAC00798: A Cell Wall-Associated Hydrolase

Provided in SEQ ID NO:85 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:86. As can be seen in FIGS. 6A and 6B, SEQ ID NO:86 aligns well with other proteins identified as cell wall-associated hydrolases. Of particular importance, it is noted that where amino acids are conserved in other cell wall-associated hydrolases, those amino acids are generally conserved in SEQ ID NO:86. Thus, the polypeptide provided in SEQ ID NO:86 is properly classified as a cell wall-associated hydrolase.

The polypeptides of SEQ ID NOs:96-100 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:86 and are encoded by nucleotide sequences of SEQ ID NOs: 91-95, respectively.

The nucleotide sequences of SEQ ID NOs:85 and 91-95 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:85 and 91-95 produce the polypeptides of SEQ ID NOs:86 and 96-100. The polypeptides of SEQ ID NOs:86 and 96-100 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:86 and 96-100 are then demonstrated to have activity as cell wall-associated hydrolases.

The isolated and/or purified polypeptides of SEQ ID NOs:86 and 96-100 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:86 and 96-100 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 14: RAAC01076: An Altronate Hydrolase

Provided in SEQ ID NO:101 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:102. As can be seen in FIGS. 7A and 7B, SEQ ID NO:102 aligns well with other proteins identified as altronate hydrolases. Of particular importance, it is noted that where amino acids are conserved in other altronate hydrolases, those amino acids are generally conserved in SEQ ID NO:102. Thus, the polypeptide provided in SEQ ID NO:102 is properly classified as an altronate hydrolase.

The polypeptides of SEQ ID NOs:113-117 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:102 and are encoded by nucleotide sequences of SEQ ID NOs:108-112, respectively.

The nucleotide sequences of SEQ ID NOs:101 and 108-112 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:101 and 108-112 produce the polypeptides of SEQ ID NOs:102 and 113-117. The polypeptides of SEQ ID NOs:102 and 113-117 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:102 and 113-117 are then demonstrated to have activity as altronate hydrolases.

The isolated and/or purified polypeptides of SEQ ID NOs:102 and 113-117 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:102 and 113-117 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 15: RAAC04341

Provided in SEQ ID NO:118 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:119. As can be seen in FIGS. 8A and 8B, SEQ ID NO:119 aligns well with proteins identified as cellulase/endoglucanase Ms. Of particular importance, it is noted that where amino acids are conserved in other cellulase/endoglucanase Ms, those amino acids are generally conserved in SEQ ID NO:119.

The polypeptides of SEQ ID NOs:130-134 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:119 and are encoded by nucleotide sequences of SEQ ID NOs:125-129, respectively.

The nucleotide sequences of SEQ ID NOs:118 and 125-129 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:118 and 125-129 produce the polypeptides of SEQ ID NOs:119 and 130-134. The polypeptides of SEQ ID NOs:119 and 130-134 are then isolated and/or purified.

The isolated and/or purified polypeptides of SEQ ID NOs:119 and 130-134 are challenged with peptides, polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:119 and 130-134 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing peptides, polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 16: Production and Purification of RAAC04341

The nucleotide sequence of SEQ ID NO:118 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:118 encodes the polypeptide of SEQ ID NO:119. SEQ ID NO:118 was cloned into the pBAD/HIS A expression vector for E. coli and the pPIC6α A expression vector for P. pastoris and provided to E. coli and P. pastoris via electroporation and/or and heat shock into competent cells. Expression of SEQ ID NO:119 was detected from both transformed E. coli and P. pastoris comprising SEQ ID NO:118 and RAAC04341 was affinity purified using a cobalt resin from these sources for activity testing.

Example 17: RAAC04342

Provided in SEQ ID NO:135 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:136. As can be seen in FIGS. 9A and 9B, SEQ ID NO:136 aligns well with other proteins identified as cellulase/endoglucanase Ms. Of particular importance, it is noted that where amino acids are conserved in other cellulase/endoglucanase Ms, those amino acids are generally conserved in SEQ ID NO:136.

The polypeptides of SEQ ID NOs:147-151 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:136 and are encoded by the nucleotide sequences of SEQ ID NOs:142-146, respectively.

The nucleotide sequences of SEQ ID NOs:135 and 142-146 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:135 and 142-146 produce the polypeptides of SEQ ID NOs:136 and 147-151. The polypeptides of SEQ ID NOs:136 and 147-151 are then isolated and/or purified.

The isolated and/or purified polypeptides of SEQ ID NOs:136 and 147-151 are challenged with peptides, polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:136 and 147-151 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing peptides, polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 18: Production and Purification of RAAC04342

The nucleotide sequence of SEQ ID NO:135 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:135 encodes the polypeptide of SEQ ID NO:136. SEQ ID NO:135 was cloned into the pBAD/HIS A expression vector for E. coli and the pPIC6α A expression vector for P. pastoris and provided to E. coli and P. pastoris via electroporation and/or heat shock into competent cells. Expression of SEQ ID NO:136 was detected from both transformed E. coli and P. pastoris comprising SEQ ID NO:135 and RAAC04342 was affinity purified using a cobalt resin from these sources for activity testing.

Example 19: RAAC04343: A Cellulase/Endoglucanase M

Provided in SEQ ID NO:152 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:153. As can be seen in FIGS. 10A and 10B, SEQ ID NO:153 aligns well with other proteins identified as cellulase/endoglucanase Ms. Of particular importance, it is noted that where amino acids are conserved in other cellulase/endoglucanase Ms, those amino acids are generally conserved in SEQ ID NO:153. Thus, the polypeptide provided in SEQ ID NO:153 is properly classified as a cellulase/endoglucanse M.

The polypeptides of SEQ ID NOs:162-166 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:153 and are encoded by the nucleotide sequences of SEQ ID NOs:157-161, respectively.

The nucleotide sequences of SEQ ID NOs:152 and 157-161 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:152 and 157-161 produce the polypeptides of SEQ ID NOs:153 and 162-166. The polypeptides of SEQ ID NOs:153 and 162-166 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:153 and 162-166 are then demonstrated to have activity as cellulase/endoglucanase Ms.

The isolated and/or purified polypeptides of SEQ ID NOs:153 and 162-166 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:153 and 162-166 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 20: Production of RAAC04343

The nucleotide sequence of SEQ ID NO:152 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:152 encodes the polypeptide of SEQ ID NO:153. SEQ ID NO:152 was cloned into the pBAD/HIS A expression vector for E. coli and the pPIC6α A expression vector for P. pastoris and provided to E. coli and P. pastoris via electroporation and/or heat shock into competent cells. Expression of SEQ ID NO:153 was detected from both transformed E. coli and P. pastoris comprising SEQ ID NO:152.

Example 21: RAAC01275: A Polygalacturonase

Provided in SEQ ID NO:167 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:168. As can be seen in FIGS. 11A-11C, SEQ ID NO:168 aligns well with other proteins identified as polygalacturonases. Of particular importance, it is noted that where amino acids are conserved in other polygalacturonases, those amino acids are generally conserved in SEQ ID NO:168. Thus, the polypeptide provided in SEQ ID NO:168 is properly classified as a polygalacturonase.

The polypeptides of SEQ ID NOs:179-183 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:168 and are encoded by the nucleotide sequences of SEQ ID NOs:174-178, respectively.

The nucleotide sequences of SEQ ID NOs:167 and 174-178 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:167 and 174-178 produce the polypeptides of SEQ ID NOs:168 and 179-183. The polypeptides of SEQ ID NOs:168 and 179-183 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:168 and 179-183 are then demonstrated to have activity as polygalacturonases.

The isolated and/or purified polypeptides of SEQ ID NOs:168 and 179-183 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:168 and 179-183 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 22: RAAC01615: An Alpha-Galactosidase

Provided in SEQ ID NO:184 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:185. As can be seen in FIGS. 12A-12C, SEQ ID NO:185 aligns well with other proteins identified as alpha-galactosidase. Of particular importance, it is noted that where amino acids are conserved in other alpha-galactosidases, those amino acids are generally conserved in SEQ ID NO:185. Thus, the polypeptide provided in SEQ ID NO:185 is properly classified as an alpha-galactosidase.

The polypeptides of SEQ ID NOs:196-200 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:185 and are encoded by the nucleotide sequences of SEQ ID NOs: 191-195, respectively.

The nucleotide sequences of SEQ ID NOs:184 and 191-195 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:184 and 191-195 produce the polypeptides of SEQ ID NOs:185 and 196-200. The polypeptides of SEQ ID NOs:185 and 196-200 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:185 and 196-200 are then demonstrated to have activity as alpha-galactosidases.

The isolated and/or purified polypeptides of SEQ ID NOs:185 and 196-200 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:185 and 196-200 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 23: RAAC01621: A Cellobiose Phosphorylase

Provided in SEQ ID NO:201 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:202. As can be seen in FIGS. 13A-13K, SEQ ID NO:202 aligns well with other proteins identified as cellobiose phosphorylases. Of particular importance, it is noted that where amino acids are conserved in other cellobiose phosphorylases, those amino acids are generally conserved in SEQ ID NO:202. Thus, the polypeptide provided in SEQ ID NO:202 is properly classified as a cellobiose phosphorylase.

The polypeptides of SEQ ID NOs:213-217 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:202 and are encoded by the nucleotide sequences of SEQ ID NOs:208-212, respectively.

The nucleotide sequences of SEQ ID NOs:201 and 208-212 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:201 and 208-212 produce the polypeptides of SEQ ID NOs:202 and 213-217. The polypeptides of SEQ ID NOs:202 and 213-217 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:202 and 213-217 are then demonstrated to have activity as cellobiose phosphorylases.

The isolated and/or purified polypeptides of SEQ ID NOs:202 and 213-217 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:202 and 213-217 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 24: RAAC01755: An Alpha-Glucosidase

Provided in SEQ ID NO:218 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:219. As can be seen in FIGS. 14A-14C, SEQ ID NO:219 aligns well with proteins identified as glycogen debranching enzymes. Of particular importance, it is noted that where amino acids are conserved in other glycogen debranching enzymes, those amino acids are generally conserved in SEQ ID NO:219.

The polypeptides of SEQ ID NOs:230-234 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:219 and are encoded by the nucleotide sequences of SEQ ID NOs:225-229, respectively.

The nucleotide sequences of SEQ ID NOs:218 and 225-229 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:218 and 225-229 produce the polypeptides of SEQ ID NOs:219 and 230-234. The polypeptides of SEQ ID NOs:219 and 230-234 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:219 and 230-234 are then demonstrated to have activity as alpha-glucosidases.

The isolated and/or purified polypeptides of SEQ ID NOs:219 and 230-234 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:219 and 230-234 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 25: Production and Purification of RAAC01755

The nucleotide sequence of SEQ ID NO:218 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:218 encodes the polypeptide of SEQ ID NO:219. SEQ ID NO:218 was cloned into the pBAD/HIS A expression vector for E. coli and the pPIC6α A expression vector for P. pastoris and provided to E. coli and P. pastoris via electroporation and/or and heat shock into competent cells. Expression of SEQ ID NO:219 was detected from both transformed E. coli and P. pastoris comprising SEQ ID NO:218 and RAAC01755 was affinity purified using a cobalt resin from these sources for activity testing.

Example 26: RAAC01887: A Cellulase/Endoglucanase M

Provided in SEQ ID NO:235 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:236. As can be seen in FIGS. 15A and 15B, SEQ ID NO:236 aligns well with other proteins identified as cellulase/endoglucanase Ms. Of particular importance, it is noted that where amino acids are conserved in other cellulase/endoglucanase Ms, those amino acids are generally conserved in SEQ ID NO:236. Thus, the polypeptide provided in SEQ ID NO:236 is properly classified as a cellulase/endoglucanase M.

The polypeptides of SEQ ID NOs:247-251 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:236 and are encoded by the nucleotide sequences of SEQ ID NOs:242-246, respectively.

The nucleotide sequences of SEQ ID NOs:235 and 242-246 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:235 and 242-246 produce the polypeptides of SEQ ID NOs:236 and 247-251. The polypeptides of SEQ ID NOs:236 and 247-251 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:236 and 247-251 are then demonstrated to have activity as cellulase/endoglucanase Ms.

The isolated and/or purified polypeptides of SEQ ID NOs:236 and 247-251 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:236 and 247-251 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 27: Production of RAAC01887

The nucleotide sequence of SEQ ID NO:235 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:235 encodes the polypeptide of SEQ ID NO:236. SEQ ID NO:235 was cloned into the pBAD/HIS A expression vector for E. coli and the pPIC6α A expression vector for P. pastoris and provided to E. coli and P. pastoris via electroporation and/or heat shock into competent cells. Expression of SEQ ID NO:236 was detected from both transformed E. coli and P. pastoris comprising SEQ ID NO:235.

Example 28: RAAC01897: An Acetyl Esterase/Acetyl Hydrolase

Provided in SEQ ID NO:252 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:253. As can be seen in FIGS. 16A and 16B, SEQ ID NO:253 aligns well with other proteins identified as acetyl esterase/acetyl hydrolases. Of particular importance, it is noted that where amino acids are conserved in other acetyl esterase/acetyl hydrolases, those amino acids are generally conserved in SEQ ID NO:253. Thus, the polypeptide provided in SEQ ID NO:253 is properly classified as an acetyl esterase/acetyl hydrolase.

The polypeptides of SEQ ID NOs:264-268 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:253 and are encoded by the nucleotide sequences of SEQ ID NOs:259-263, respectively.

The nucleotide sequences of SEQ ID NOs:252 and 259-263 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:252 and 259-263 produce the polypeptides of SEQ ID NOs:253 and 264-268. The polypeptides of SEQ ID NOs:253 and 264-268 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:253 and 264-268 are then demonstrated to have activity as acetyl esterase/acetyl hydrolases.

The isolated and/or purified polypeptides of SEQ ID NOs:253 and 264-268 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:253 and 264-268 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 29: RAAC01917: A Beta-1,4-Xylanase

Provided in SEQ ID NO:269 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:270. As can be seen in FIGS. 17A and 17B, SEQ ID NO:270 aligns well with other proteins identified as beta-1,4-xylanases. Of particular importance, it is noted that where amino acids are conserved in other beta-1,4-xylanases, those amino acids are generally conserved in SEQ ID NO:270. Thus, the polypeptide provided in SEQ ID NO:270 is properly classified as a beta-1,4-xylanase.

The polypeptides of SEQ ID NOs:281-285 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:270 and are encoded by the nucleotide sequences of SEQ ID NOs:276-280, respectively.

The nucleotide sequences of SEQ ID NOs:269 and 276-280 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:269 and 276-280 produce the polypeptides of SEQ ID NOs:270 and 281-285. The polypeptides of SEQ ID NOs:270 and 281-285 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:270 and 281-285 are then demonstrated to have activity as beta-1,4-xylanases.

The isolated and/or purified polypeptides of SEQ ID NOs:270 and 281-285 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:270 and 281-285 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 30: Production of RAAC01917

The nucleotide sequence of SEQ ID NO:269 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:269 encodes the polypeptide of SEQ ID NO:270. SEQ ID NO:269 was cloned into the pBAD/HIS A expression vector for E. coli and the pPIC6α A expression vector for P. pastoris and provided to E. coli and P. pastoris via electroporation and/or heat shock into competent cells, respectively. Expression of SEQ ID NO:270 was detected from both transformed E. coli and P. pastoris comprising SEQ ID NO:269.

Example 31: 1,4-β-Glucan Cellobiohydrolase (CBH) Activity of RAAC01917

RAAC01917 purified from E. coli was tested for CBH activity using an assay summarized as follows:

A solution of p-nitrophenyl β-D-cellobioside was created by dissolving 85 mg of p-nitrophenyl β-D-cellobioside in 10 mL water. Individual aliquots of this solution were then diluted 1:9.2 in an appropriate buffer at 50 mM for pHs ranging from 1 to 10. Buffers include maleic acid (pH 1.0-2.0), Glycine HCl (pH 3.0), sodium acetate (pH 3.5-5.0), sodium phosphate (pH 6.0-8.0), Tris-HCl (pH 9.0), and CAPS buffer (pH 10.0).

Samples of purified RAAC01917 generated in Example 30 were diluted 1:5, 1:10; 1:20, and 1:50 in the appropriate buffer at 50 mM for pHs ranging from 1 to 10. Samples (RAAC01917 samples and positive controls) were placed the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. One Hundred ninety μL of p-nitrophenyl β-D-cellobioside solution, preheated to 50, 60, 70, 80, or 90 degrees Celsius, was then added to each well and the plate was further incubated at 50, 60, 70, 80, or 90 degrees Celsius for 3 minutes. One hundred μL of 2.0 M sodium carbonate was then added to each well and the CBH activity was measured in a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 405 nm.

Specific activity for RAAC01917 as determined appears in FIG. 30.

Example 32: Endo-1,4-β-Xylanase (XYL) Activity of RAAC01917

RAAC01917 purified from E. coli was tested for XYL activity using an assay summarized as follows:

A solution of wheat arabinoxylan (WAX) was created by wetting 0.5 g of WAX with 3 mL ethanol and then adding an additional 40 mL of water. Individual aliquots of this solution were then diluted in an appropriate buffer at 50 mM for pHs ranging from 1 to 9. Buffers included maleic acid (pH 1.0-2.0), Glycine HCl (pH 3.0), sodium acetate (pH 3.5-5.0), sodium phosphate (pH 6.0-8.0), and Tris-HCl (pH 9.0).

Samples of purified RAAC01917 generated in Example 30 were diluted 1:5, 1:10; 1:20, and 1:50 in the appropriate buffer at 50 mM for pHs ranging from 1 to 9. Samples (RAAC01917 samples and positive controls) were placed the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. WAX solution, preheated to 50, 60, 70, 80, or 90 degrees Celsius, was then added to each well and the plate was incubated at 50, 60, 70, 80, or 90 degrees Celsius for 10 minutes. One hundred μL of dinitrosalicylic acid solution was then added to each well and the plate was further incubated at 80 degrees Celsius for an additional 10 minutes. The xylanase activity was measured using a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 540 nm.

Specific activity for RAAC01917 as determined appears in FIG. 31.

Example 33: RAAC02404: A Cinnamoyl Ester Hydrolase

Provided in SEQ ID NO:286 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:287. As can be seen in FIGS. 18A and 18B, SEQ ID NO:287 aligns well with other proteins identified as cinnamoyl ester hydrolases. Of particular importance, it is noted that where amino acids are conserved in other cinnamoyl ester hydrolases, those amino acids are generally conserved in SEQ ID NO:287. Thus, the polypeptide provided in SEQ ID NO:287 is properly classified as a cinnamoyl ester hydrolase.

The polypeptides of SEQ ID NOs:298-302 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:287 and are encoded by the nucleotide sequences of SEQ ID NOs:293-297, respectively.

The nucleotide sequences of SEQ ID NOs:286 and 293-297 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:286 and 293-297 produce the polypeptides of SEQ ID NOs:287 and 298-302. The polypeptides of SEQ ID NOs:287 and 298-302 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:287 and 298-302 are then demonstrated to have activity as cinnamoyl ester hydrolases.

The isolated and/or purified polypeptides of SEQ ID NOs:287 and 298-302 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:287 and 298-302 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 34: RAAC02424: A Carboxylesterase Type B

Provided in SEQ ID NO:303 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:304. As can be seen in FIGS. 19A and 19B, SEQ ID NO:304 aligns well with other proteins identified as carboxylesterase type Bs. Of particular importance, it is noted that where amino acids are conserved in other carboxylesterase type Bs, those amino acids are generally conserved in SEQ ID NO:304. Thus, the polypeptide provided in SEQ ID NO:304 is properly classified as a carboxylesterase type B.

The polypeptides of SEQ ID NOs:315-319 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:304 and are encoded by the nucleotide sequences of SEQ ID NOs: 310-314, respectively.

The nucleotide sequences of SEQ ID NOs:303 and 310-314 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:303 and 310-314 produce the polypeptides of SEQ ID NOs:304 and 315-319. The polypeptides of SEQ ID NOs:304 and 315-319 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:304 and 315-319 are then demonstrated to have activity as carboxylesterase type Bs.

The isolated and/or purified polypeptides of SEQ ID NOs:304 and 315-319 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:304 and 315-319 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 35: Production and Purification of RAAC02424

The nucleotide sequence of SEQ ID NO:303 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:303 encodes the polypeptide of SEQ ID NO:304. SEQ ID NO:303 was cloned into the pBAD/HIS A expression vector for E. coli and provided to E. coli via electroporation and/or heat shock into competent cells. Expression of SEQ ID NO:304 was detected from both transformed E. coli comprising SEQ ID NO:303 and RAAC02424 was affinity purified using a cobalt resin from these sources for activity testing.

Example 36: RAAC02616: A Beta Galactosidase/Beta-Glucuronidase

Provided in SEQ ID NO:320 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:321. As can be seen in FIGS. 20A-20D, SEQ ID NO:321 aligns well with other proteins identified as beta galactosidase/beta-glucuronidases. Of particular importance, it is noted that where amino acids are conserved in other beta galactosidase/beta-glucuronidases, those amino acids are generally conserved in SEQ ID NO:321. Thus, the polypeptide provided in SEQ ID NO:321 is properly classified as a beta galactosidase/beta-glucuronidase.

The polypeptides of SEQ ID NOs:331-335 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:321 and are encoded by the nucleotide sequences of SEQ ID NOs:326-330, respectively.

The nucleotide sequences of SEQ ID NOs:320 and 326-330 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:320 and 326-330 produce the polypeptides of SEQ ID NOs:321 and 331-335. The polypeptides of SEQ ID NOs:321 and 331-335 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:321 and 331-335 are then demonstrated to have activity as beta galactosidase/beta-glucuronidases.

The isolated and/or purified polypeptides of SEQ ID NOs:321 and 331-335 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:321 and 331-335 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 37: RAAC02661: A Xylan Alpha-1,2-Glucuronidase

Provided in SEQ ID NO:336 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:337. As can be seen in FIGS. 21A-21D, SEQ ID NO:337 aligns well with other proteins identified as xylan alpha-1,2-glucuronidases. Of particular importance, it is noted that where amino acids are conserved in other xylan alpha-1,2-glucuronidases, those amino acids are generally conserved in SEQ ID NO:337. Thus, the polypeptide provided in SEQ ID NO:337 is properly classified as a xylan alpha-1,2-glucuronidase.

The polypeptides of SEQ ID NOs:348-352 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:337 and are encoded by the nucleotide sequences of SEQ ID NOs:343-347, respectively.

The nucleotide sequences of SEQ ID NOs:336 and 343-347 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:336 and 343-347 produce the polypeptides of SEQ ID NOs:337 and 348-352. The polypeptides of SEQ ID NOs:337 and 348-352 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:337 and 348-352 are then demonstrated to have activity as xylan alpha-1,2-glucuronidases.

The isolated and/or purified polypeptides of SEQ ID NOs:337 and 348-352 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:337 and 348-352 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 38: Production and Purification of RAAC02661

The nucleotide sequence of SEQ ID NO:337 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:336 encodes the polypeptide of SEQ ID NO:337. SEQ ID NO:336 was cloned into the pBAD/HIS A expression vector for E. coli and provided to E. coli via electroporation. Expression of SEQ ID NO:337 was detected from transformed E. coli comprising SEQ ID NO:336 and RAAC02661 was affinity purified using a cobalt resin for activity testing.

Example 39: α-Glucuronidase (AGUR) Activity of RAAC02661

RAAC02661 purified from E. coli was tested for XYL activity using an assay summarized as follows:

A solution of aldouronic acids (AUAs) was created by diluting 50 μL of a mixture of aldotetraouronic acid, aldotriouronic acid and aldobiouronic acid (40:40:20; Aldouronic Acid Mixture, Megazyme Cat. No. O-AMX) with 1.95 mL of an appropriate buffer at 50 mM for pHs ranging from 1 to 9. Buffers included maleic acid (pH 1.0-2.0), Glycine HCl (pH 3.0), sodium acetate (pH 3.5-5.0), sodium phosphate (pH 6.0-8.0), Tris-HCl (pH 9.0), and CAPS buffer (pH 10.0).

Samples of purified RAAC02661 generated in Example 38 were diluted to an appropriate concentration for activity measurement in the appropriate buffer at 50 mM for pHs ranging from 1 to 10. Samples (RAAC02661 samples and positive controls) were placed in the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. AUA solution, preheated to 50, 60, 70, 80, or 90 degrees Celsius, was then added to each well and the plate was incubated at 50, 60, 70, 80, or 90 degrees Celsius for 3 minutes. Dinitrosalicylic acid solution was then added to each well and the plate was further incubated at 80 degrees Celsius for an additional 10 minutes. The AGUR activity was measured using a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 540 nm. Specific activity for RAAC02661 as determined appears in FIG. 32.

Example 40: RAAC02925: A 3-Hydroxyisobutyryl-CoA Hydrolase

Provided in SEQ ID NO:353 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:354. As can be seen in FIGS. 22A-22C, SEQ ID NO:354 aligns well with other proteins identified as 3-hydroxyisobutyryl-CoA hydrolases. Of particular importance, it is noted that where amino acids are conserved in other 3-hydroxyisobutyryl-CoA hydrolases, those amino acids are generally conserved in SEQ ID NO:354. Thus, the polypeptide provided in SEQ ID NO:354 is properly classified as a 3-hydroxyisobutyryl-CoA hydrolase.

The polypeptides of SEQ ID NOs:365-369 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:354 and are encoded by the nucleotide sequences of SEQ ID NOs: 360-364, respectively.

The nucleotide sequences of SEQ ID NOs:353 and 360-364 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:353 and 360-364 produce the polypeptides of SEQ ID NOs:354 and 365-369. The polypeptides of SEQ ID NOs:354 and 365-369 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:354 and 365-369 are then demonstrated to have activity as 3-hydroxyisobutyryl-CoA hydrolases.

The isolated and/or purified polypeptides of SEQ ID NOs:354 and 365-369 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:354 and 365-369 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 41: RAAC03001: A Beta-Glucosidase

Provided in SEQ ID NO:370 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:371. As can be seen in FIGS. 23A-23D, SEQ ID NO:371 aligns well with other proteins identified as beta-glucosidases. Of particular importance, it is noted that where amino acids are conserved in other beta-glucosidases, those amino acids are generally conserved in SEQ ID NO:371. Thus, the polypeptide provided in SEQ ID NO:371 is properly classified as a beta-glucosidase.

The polypeptides of SEQ ID NOs:382-386 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:371 and are encoded by nucleotide sequences of SEQ ID NOs: 377-381, respectively.

The nucleotide sequences of SEQ ID NOs:370 and 377-381 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:370 and 377-381 produce the polypeptides of SEQ ID NOs:371 and 382-386. The polypeptides of SEQ ID NOs:371 and 382-386 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:371 and 382-386 are then demonstrated to have activity as beta-glucosidases.

The isolated and/or purified polypeptides of SEQ ID NOs:371 and 382-386 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:371 and 382-386 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 42: Production and Purification of RAAC03001: A Beta-Glucosidase

The nucleotide sequence of SEQ ID NO:370 was cloned from Alicyclobacillus acidocaldarius. SEQ ID NO:370 encodes the polypeptide of SEQ ID NO:371. SEQ ID NO:370 was cloned into the pBAD/HIS A expression vector for E. coli and the pPIC6α A expression vector for P. pastoris and provided to E. coli and P. pastoris via electroporation and/or heat shock into competent cells. Expression of SEQ ID NO:370 was detected from both transformed E. coli and P. pastoris comprising SEQ ID NO:370 and RAAC03001 was affinity purified using a cobalt resin from these sources for activity testing.

Example 43: Beta-Glucosidase Activity of RAAC03001

RAAC03001 purified from E. coli was tested for beta-glucosidase activity using the assay summarized as follows: A solution of p-nitrophenyl β-D-glucopyranoside (Sigma Cat. No. N7006) was created by dissolving 301.25 mg of p-nitrophenyl β-D-glucopyranoside in 20 mL water. Individual aliquots of this solution were then diluted 1:25 in an appropriate buffer at 50 mM for pHs ranging from 1 to 9. Buffers included maleic acid (pH 1.0-2.0), Glycine HCl (pH 3.0), sodium acetate (pH 3.5-5.0), sodium phosphate (pH 6.0-8.0), and Tris-HCl (pH 9.0).

Samples of purified RAAC03001 generated in Example 42 were diluted to an appropriate concentration for activity measurement in the appropriate buffer at 50 mM for pHs ranging from 1 to 9. Samples (RAAC03001 samples and positive controls) were placed in the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. One hundred ninety μL of β-glucopyranoside solution, preheated to temperatures ranging from 50 to 90 degrees Celsius, was then added to each well and the plate was further incubated at temperatures ranging from 50 to 90 degrees Celsius for 3 minutes. One hundred μL of 2.0 M sodium carbonate was then added to each well and the beta-glucosidase activity was measured in a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 405 nm.

The results of the above assay are presented in FIG. 33 and demonstrate that the RAAC03001 protein isolated from E. coli had a range of beta-glucosidase activity at a variety of temperature and pH combinations.

Example 44: α-L-Arabinofuranosidase (AFS) Activity of RAAC03001

RAAC03001 purified from E. coli was tested for AFS activity using the assay summarized as follows: A solution of p-nitrophenyl α-L-arabinofuranoside was created by dissolving 271.22 mg of p-nitrophenyl α-L-arabinofuranoside in 10 mL methanol. Individual aliquots of this solution were then diluted 1:50 in an appropriate buffer at 50 mM for pHs ranging from 1 to 9. Buffers included maleic acid (pH 1.0-2.0), Glycine HCl (pH 3.0), sodium acetate (pH 3.5-5.0), sodium phosphate (pH 6.0-8.0), and Tris-HCl (pH 9.0).

Samples of purified RAAC03001 generated in Example 42 were diluted to an appropriate concentration for activity measurement in the appropriate buffer at 50 mM for pHs ranging from 1 to 9. Samples (RAAC03001 samples and positive controls) were placed in the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. One hundred ninety μL of arabinofuranoside solution, preheated to temperatures ranging from 50 to 90 degrees Celsius, was then added to each well and the plate was further incubated at temperatures ranging from 50 to 90 degrees Celsius for 3 minutes. One hundred μL of 2.0 M sodium carbonate was then added to each well and the AFS activity was measured in a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 405 nm.

The results of the above assay are presented in FIG. 34 and demonstrate that the RAAC03001 protein isolated from E. coli had a range of AFS activity at a variety of temperature and pH combinations.

Example 45: β-Galactosidase (BGAL) Activity of RAAC03001

RAAC03001 purified from E. coli was tested for BGAL activity using the assay summarized as follows: A solution of p-nitrophenyl β-D-galactopyranoside was created by dissolving 30.13 mg of p-nitrophenyl β-D-galactopyranoside in 10 mL buffer. Individual aliquots of this solution were then diluted 1:5 in an appropriate buffer at 50 mM for pHs ranging from 1 to 9. Buffers included maleic acid (pH 1.0-2.0), Glycine HCl (pH 3.0), sodium acetate (pH 3.5-5.0), sodium phosphate (pH 6.0-8.0), and Tris-HCl (pH 9.0).

Samples of purified RAAC03001 generated in Example 42 were diluted to an appropriate concentration for activity measurement in the appropriate buffer at 50 mM for pHs ranging from 1 to 9. Samples (RAAC03001 samples and positive controls) were placed in the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. One hundred ninety μL of p-nitrophenyl β-D-galactopyranoside solution, preheated to temperatures ranging from 50 to 90 degrees Celsius, was then added to each well and the plate was further incubated at temperatures ranging from 50 to 90 degrees Celsius for 3 minutes. One hundred μL of 2.0 M sodium carbonate was then added to each well and the BGAL activity was measured in a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 405 nm.

The results of the above assay are presented in FIG. 35 and demonstrate that the RAAC03001 protein isolated from E. coli had a range of BGAL activity at a variety of temperature and pH combinations.

Example 46: β-Xylosidase (BXYL) Activity of RAAC03001

RAAC03001 purified from E. coli was tested for BXYL activity using the assay summarized as follows: A solution of p-nitrophenyl β-D-xylopyranoside was created by dissolving 271.22 mg of p-nitrophenyl β-D-xylopyranoside in 10 mL methanol. Individual aliquots of this solution were then diluted 1:50 in an appropriate buffer at 50 mM for pHs ranging from 1 to 9. Buffers included maleic acid (pH 1.0-2.0), Glycine HCl (pH 3.0), sodium acetate (pH 3.5-5.0), sodium phosphate (pH 6.0-8.0), and Tris-HCl (pH 9.0).

Samples of purified RAAC03001 generated in Example 42 were diluted to an appropriate concentration for activity measurement in the appropriate buffer at 50 mM for pHs ranging from 1 to 9. Samples (RAAC03001 samples and positive controls) were placed the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. One hundred ninety μL of p-nitrophenyl β-D-xylopyranoside solution, preheated to temperatures ranging from 50 to 90 degrees Celsius, was then added to each well and the plate was further incubated at temperatures ranging from 50 to 90 degrees Celsius for 3 minutes. One hundred μL of 2.0 M sodium carbonate was then added to each well and the BXYL activity was measured in a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 405 nm.

The results of the above assay are presented in FIG. 36 and demonstrate that the RAAC03001 protein isolated from E. coli had a range of BXYL activity at a variety of temperature and pH combinations.

Example 47: 1,4-β-Glucan Cellobiohydrolase (CBH) Activity of RAAC03001

RAAC03001 purified from E. coli was tested for CBH activity using the assay summarized as follows: A solution of p-nitrophenyl β-D-cellobioside was created by dissolving 85 mg of p-nitrophenyl β-D-cellobioside in 10 mL water. Individual aliquots of this solution were then diluted 1:9.2 in an appropriate buffer at 50 mM for pHs ranging from 1 to 9. Buffers included maleic acid (pH 1.0-2.0), Glycine HCl (pH 3.0), sodium acetate (pH 3.5-5.0), sodium phosphate (pH 6.0-8.0), and Tris-HCl (pH 9.0).

Samples of purified RAAC03001 generated in Example 42 were diluted to an appropriate concentration for activity measurement in the appropriate buffer at 50 mM for pHs ranging from 1 to 9. Samples (RAAC03001 samples and positive controls) were placed in the wells of a 96-well plate in 10 μL aliquots. Blanks of buffer only were placed in some wells. One hundred ninety μL of p-nitrophenyl β-D-cellobioside solution, preheated to temperatures ranging from 50 to 90 degrees Celsius, was then added to each well and the plate was further incubated at temperatures ranging from 50 to 90 degrees Celsius for 3 minutes. One hundred μL of 2.0 M sodium carbonate was then added to each well and the CBH activity was measured in a 96-well plate reader (Molecular Devices UV-Vis) at a wavelength of 405 nm.

The results of the above assay are presented in FIG. 37 and demonstrate that the RAAC03001 protein isolated from E. coli had a range of CBH activity at a variety of temperature and pH combinations.

Example 48: RAAC02913: A Chitooligosaccharide Deacetylase

Provided in SEQ ID NO:387 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:388. As can be seen in FIGS. 24A and 24B, SEQ ID NO:388 aligns well with other proteins identified as chitooligosaccharide deacetylases. Of particular importance, it is noted that where amino acids are conserved in other chitooligosaccharide deacetylases, those amino acids are generally conserved in SEQ ID NO:388. Thus, the polypeptide provided in SEQ ID NO:388 is properly classified as a chitooligosaccharide deacetylase.

The polypeptides of SEQ ID NOs:399-403 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:388 and are encoded by the nucleotide sequences of SEQ ID NOs: 394-398, respectively.

The nucleotide sequences of SEQ ID NOs:387 and 394-398 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:387 and 394-398 produce the polypeptides of SEQ ID NOs:388 and 399-403. The polypeptides of SEQ ID NOs:388 and 399-403 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:388 and 399-403 are then demonstrated to have activity as chitooligosaccharide deacetylases.

The isolated and/or purified polypeptides of SEQ ID NOs:388 and 399-403 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:388 and 399-403 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 49: RAAC02839: A Chitooligosaccharide Deacetylase

Provided in SEQ ID NO:404 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:405. As can be seen in FIGS. 25A and 25B, SEQ ID NO:405 aligns well with other proteins identified as chitooligosaccharide deacetylases. Of particular importance, it is noted that where amino acids are conserved in other chitooligosaccharide deacetylases, those amino acids are generally conserved in SEQ ID NO:405. Thus, the polypeptide provided in SEQ ID NO:405 is properly classified as a chitooligosaccharide deacetylase.

The polypeptides of SEQ ID NOs:416-420 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:405 and are encoded by the nucleotide sequences of SEQ ID NOs:411-415, respectively.

The nucleotide sequences of SEQ ID NOs:404 and 411-415 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:404 and 411-415 produce the polypeptides of SEQ ID NOs:405 and 416-420. The polypeptides of SEQ ID NOs:405 and 416-420 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:405 and 416-420 are then demonstrated to have activity as chitooligosaccharide deacetylases.

The isolated and/or purified polypeptides of SEQ ID NOs:405 and 416-420 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:405 and 416-420 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 50: RAAC00961: A Chitooligosaccharide Deacetylase

Provided in SEQ ID NO:421 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:422. As can be seen in FIGS. 26A-26C, SEQ ID NO:422 aligns well with other proteins identified as chitooligosaccharide deacetylases. Of particular importance, it is noted that where amino acids are conserved in other chitooligosaccharide deacetylases, those amino acids are generally conserved in SEQ ID NO:422. Thus, the polypeptide provided in SEQ ID NO:422 is properly classified as a chitooligosaccharide deacetylase.

The polypeptides of SEQ ID NOs:433-437 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:422 and are encoded by the nucleotide sequences of SEQ ID NOs:428-432, respectively.

The nucleotide sequences of SEQ ID NOs:421 and 428-432 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:421 and 428-432 produce the polypeptides of SEQ ID NOs:422 and 433-437. The polypeptides of SEQ ID NOs:422 and 433-437 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:422 and 433-437 are then demonstrated to have activity as chitooligosaccharide deacetylases.

The isolated and/or purified polypeptides of SEQ ID NOs:422 and 433-437 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:422 and 433-437 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 51: RAAC00361: A Chitooligosaccharide Deacetylase

Provided in SEQ ID NO:438 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:439. As can be seen in FIGS. 27A and 27B, SEQ ID NO:439 aligns well with other proteins identified as chitooligosaccharide deacetylases. Of particular importance, it is noted that where amino acids are conserved in other chitooligosaccharide deacetylases, those amino acids are generally conserved in SEQ ID NO:439. Thus, the polypeptide provided in SEQ ID NO:439 is properly classified as a chitooligosaccharide deacetylase.

The polypeptides of SEQ ID NOs:450-454 are representative examples of conservative substitutions in the polypeptide of SEQ ID NO:439 and are encoded by the nucleotide sequences of SEQ ID NOs:445-449, respectively.

The nucleotide sequences of SEQ ID NOs:438 and 445-449 are placed into expression vectors using techniques standard in the art. The vectors are then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vectors comprising SEQ ID NOs:438 and 445-449 produce the polypeptides of SEQ ID NOs:439 and 450-454. The polypeptides of SEQ ID NOs:439 and 450-454 are then isolated and/or purified. The isolated and/or purified polypeptides of SEQ ID NOs:439 and 450-454 are then demonstrated to have activity as chitooligosaccharide deacetylases.

The isolated and/or purified polypeptides of SEQ ID NOs:439 and 450-454 are challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NOs:439 and 450-454 are demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 52: A Glucan 1,4-Alpha-Maltohydrolase

Provided in SEQ ID NO:455 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:456. SEQ ID NO:456 aligns at about 99% identity with gi:6686566, a glucan 1,4-alpha-maltohydrolase. Thus, the polypeptide provided in SEQ ID NO:456 is properly classified as a glucan 1,4-alpha-maltohydrolase.

The nucleotide sequence of SEQ ID NO:455 is placed into an expression vector using techniques standard in the art. The vector is then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vector comprising SEQ ID NO:455 produces the polypeptide of SEQ ID NO:456. The polypeptide of SEQ ID NO:456 is then isolated and/or purified. The isolated and/or purified polypeptide of SEQ ID NO:456 is then demonstrated to have activity as glucan 1,4-alpha-maltohydrolase.

The isolated and/or purified polypeptide of SEQ ID NO:456 is then challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NO:456 is demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 53: A Glycosidase

Provided in SEQ ID NO:457 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:458. SEQ ID NO:458 aligns at about 99% identity with gi:39301, a glycosidase. Thus, the polypeptide provided in SEQ ID NO:458 is properly classified as a glycosidase.

The nucleotide sequence of SEQ ID NO:457 is placed into an expression vector using techniques standard in the art. The vector is then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vector comprising SEQ ID NO:457 produces the polypeptide of SEQ ID NO:458. The polypeptide of SEQ ID NO:458 is then isolated and/or purified. The isolated and/or purified polypeptide of SEQ ID NO:458 is then demonstrated to have activity as a glycosidase.

The isolated and/or purified polypeptide of SEQ ID NO:458 is then challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptide of SEQ ID NO:458 is demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 54: An Acetyl Esterase

Provided in SEQ ID NO:459 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:460. SEQ ID NO:460 aligns at about 95% identity with gi:151567607, an acetyl esterase. Thus, the polypeptide provided in SEQ ID NO:460 is properly classified as an acetyl esterase.

The nucleotide sequence of SEQ ID NO:459 is placed into an expression vector using techniques standard in the art. The vector is then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vector comprising SEQ ID NO:459 produces the polypeptide of SEQ ID NO:460. The polypeptide of SEQ ID NO:460 is then isolated and/or purified. The isolated and/or purified polypeptide of SEQ ID NO:460 is then demonstrated to have activity as an acetyl esterase.

The isolated and/or purified polypeptide of SEQ ID NO:460 is then challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptide of SEQ ID NO:460 is demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 55: An Endo-Beta-1,4-Mannanase

Provided in SEQ ID NO:461 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:462. SEQ ID NO:462 aligns at about 92% identity with gi:110611196, an endo-beta-1,4-mannanase. Thus, the polypeptide provided in SEQ ID NO:462 is properly classified as an endo-beta-1,4-mannanase.

The nucleotide sequence of SEQ ID NO:461 is placed into an expression vector using techniques standard in the art. The vector is then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vector comprising SEQ ID NO:461 produces the polypeptide of SEQ ID NO:462. The polypeptide of SEQ ID NO:462 is then isolated and/or purified. The isolated and/or purified polypeptide of SEQ ID NO:462 is then demonstrated to have activity as an endo-beta-1,4-mannanase.

The isolated and/or purified polypeptide of SEQ ID NO:462 is then challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptides of SEQ ID NO:462 is demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

Example 56: A Beta-Glucosidase

Provided in SEQ ID NO:463 is a nucleotide sequence isolated from Alicyclobacillus acidocaldarius and encoding the polypeptide of SEQ ID NO:464. SEQ ID NO:464 aligns at about 92% identity with gi:110611196, a beta-glucosidase. Thus, the polypeptide provided in SEQ ID NO:464 is properly classified as a beta-glucosidase.

The nucleotide sequence of SEQ ID NO:463 is placed into an expression vector using techniques standard in the art. The vector is then provided to cells such as bacteria cells or eukaryotic cells such as Sf9 cells or CHO cells. In conjunction with the normal machinery present in the cells, the vector comprising SEQ ID NO:463 produces the polypeptide of SEQ ID NO:464. The polypeptide of SEQ ID NO:464 is then isolated and/or purified. The isolated and/or purified polypeptide of SEQ ID NO:464 is then demonstrated to have activity as a beta-glucosidase.

The isolated and/or purified polypeptide of SEQ ID NO:464 is then challenged with polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups. The isolated and/or purified polypeptide of SEQ ID NO:464 is demonstrated to have activity in at least partially degrading, cleaving, and/or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan-, glucan-, galactan-, and/or mannan-decorating groups.

All references, including publications, patents, and patent applications, cited herein are hereby incorporated by reference to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and were set forth in its entirety herein.

While this invention has been described in certain embodiments, the present invention can be further modified within the spirit and scope of this disclosure. This application is therefore intended to cover any variations, uses, or adaptations of the invention using its general principles. Further, this application is intended to cover such departures from the present disclosure as come within known or customary practice in the art to which this invention pertains and which fall within the limits of the appended claims and their legal equivalents.

BIBLIOGRAPHIC REFERENCES

-   Barany F., 1991, PNAS, USA, 88:189-193. -   Bertoldo et al., 2004, Eng. Life Sci., 4, No. 6. -   Buckholz R. G., 1993, Yeast Systems for the Expression of     Heterologous Gene Products, Curr. Op. Biotechnology 4:538-542. -   Burg J. L. et al., 1996, Mol. and Cell. Probes, 10:257-271. -   Chu B. C. F. et al., 1986, NAR, 14:5591-5603. -   Duck P. et al., 1990, Biotechniques, 9:142-147. -   Edwards C. P. and A. Aruffo, 1993, Current Applications of COS     Cell-Based Transient Expression Systems, Curr. Op. Biotechnology     4:558-563. -   Garrote G., H. Dominguez, and J. C. Parajo, 2001, Manufacture of     Xylose-Based Fermentation Media From Corncobs by Posthydrolysis of     Autohydrolysis Liquors, Appl. Biochem. Biotechnol., 95:195-207. -   Guateli J. C. et al., 1990, PNAS, USA, 87:1874-1878. -   Hamelinck C. N., G. van Hooijdonk, and A. P. C. Faaij, 2005, Ethanol     From Lignocellulosic Biomass: Techno-Economic Performance in Short-,     Middle-, and Long-Term, Biomass Bioenergy, 28:384-410. -   Houben-Weyl, 1974, Methoden der Organischen Chemie, E. Wunsch, ed.,     Volume 15-I and 15-II, Thieme, Stuttgart. -   Huygen K. et al., 1996, Nature Medicine, 2(8):893-898. -   Innis M. A. et al., 1990, in PCR Protocols, A Guide to Methods and     Applications, San Diego, Academic Press. -   Jeffries, 1996, Curr. Op. in Biotech., 7:337-342. -   Kievitis T. et al., 1991, J Virol. Methods, 35:273-286. -   Kohler G. et al., 1975, Nature, 256(5517):495-497. -   Kwoh D. Y. et al., 1989, PNAS, USA, 86:1173-1177. -   Liu C. and C. E. Wyman, 2003, The Effect of Flow Rate of Compressed     Hot Water on Xylan, Lignin, and Total Mass Removal From Corn Stover,     Ind. Eng. Chem. Res., 42:5409-5416. -   Luckow V. A., 1993, Baculovirus Systems for the Expression of Human     Gene Products, Curr. Op. Biotechnology 4:564-572. -   Lynd et al., 2002, Micro. and Mol. Biol. Rev., Vol. 66, No. 3, pp.     506-577. -   Malherbe and Cloete, 2002, Reviews in Environmental Science and     Biotechnology, 1:105-114. -   Matthews J. A. et al., 1988, Analy. Biochem., 169:1-25. -   Merrifield R. D., 1966, J. Am. Chem. Soc., 88(21):5051-5052. -   Miele E. A. et al., 1983, J. Mol. Biol., 171:281-295. -   Mielenz, 2001, Curr. Op. in Micro., 4:324-329. -   Olins P. O. and S. C. Lee, 1993, Recent Advances in Heterologous     Gene Expression in E. coli, Curr. Op. Biotechnology 4:520-525. -   Rolfs A. et al., 1991, PCR Topics, Usage of Polymerase Chain     Reaction in Genetic and Infectious Disease, Berlin: Springer-Verlag. -   Sambrook J. et al., 1989, Molecular Cloning: A Laboratory Manual,     Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press. -   Sanchez-Pescador R., 1988, J. Clin. Microbiol., 26(10): 1934-1938. -   Segev D., 1992, “Non-radioactive Labeling and Detection of     Biomolecules,” C. Kessler, ed., Springer-Verlag, Berlin, N.Y.:     197-205. -   Shallom and Shoham, 2003, Curr. Op. in Micro., 6:219-228. -   Tsao G. T., M. R. Ladisch, and H. R. Bungay, 1987, Biomass Refining,     In Advanced Biochemical Engineering, Wiley Interscience, N.Y.,     79-101. -   Urdea M. S., 1988, Nucleic Acids Research, II: 4937-4957. -   Vieille and Zeikus, 2001, Micro. and Mol. Biol. Rev., Vol. 65, No.     1, pp. 1-43. -   Walker G. T. et al., 1992, NAR 20:1691-1696. -   Walker G. T. et al., 1992, PNAS, USA, 89:392-396. -   White B. A. et al., 1997, Methods in Molecular Biology, 67, Humana     Press, Totowa, N.J. 

1-6. (canceled)
 7. A method of at least partially degrading, cleaving, or removing polysaccharides, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycosides, xylan, glucan, galactan, or mannan decorating groups, the method comprising: placing a polypeptide at least 90% sequence identity to SEQ ID No. 287 in fluid contact with a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycoside, xylan, glucan, galactan, or mannan decorating group; wherein the polypeptide has an enzymatic activity as an esterase.
 8. The method according to claim 7, wherein placing a polypeptide at least 90% sequence identity to SEQ ID No. 287 in fluid contact with a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycoside, xylan, glucan, galactan, or mannan decorating group occurs at or below about pH
 4. 9. The method according to claim 7, wherein placing a polypeptide having at least 90% sequence identity to SEQ ID No. 287 in fluid contact with a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycoside, xylan, glucan, galactan, or mannan decorating group occurs at a temperature at or above 50 degrees Celsius.
 10. The method according to claim 7, wherein the polypeptide is glycosylated, pegylated, or otherwise posttranslationally modified.
 11. The method according to claim 7, wherein the polypeptide is encoded by a nucleic acid having at least 90% identity to SEQ ID NO:286.
 12. The method according to claim 7, wherein placing a polypeptide having at least 90% sequence identity to SEQ ID NO:287 in fluid contact with a polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycoside, xylan, glucan, galactan, or mannan decorating group comprises translating a nucleic acid having at least 90% identity to SEQ ID NO:286 in fluid contact with the polysaccharide, lignocellulose, cellulose, hemicellulose, lignin, starch, chitin, polyhydroxybutyrate, heteroxylans, glycoside, xylan, glucan, galactan, or mannan decorating group. 